IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

DECLARATION OF JOHN C. ROCKETT, Ph.D. 
UNDER 37 C.F.R. § 1.132 

I, JOHN COUGHLIN ROCKETT III, Ph.D., declare and 
follows : 

1. Since 1995 I have been engaged full-time in 
molecular toxicology research, with an emphasis on the 
application of expression profiling techniques, including but 
not limited to nucleic acid microarray expression profiling 
techniques, to studies of the mechanisms of toxicant action 
and to the design of assays to monitor toxicant exposure. 

2. My curriculum vitae, including my list of 
publications, is attached hereto as Exhibit A. 

3. For the past 5 years, my work has focused 
primarily on analyzing the effects of potentially hazardous 
environmental agents, such as heat, water disinfectant 
byproducts, and conazole fungicides on the male reproductive 
tract. Although we are interested in the basic mechanisms of 
action of such toxicants, we also have two practical goals in 
mind: first, to identify individual agents and families of 
agents that adversely affect male reproductive development and 
function, and second, to develop methods for monitoring human 
exposure to such agents, particularly methods capable of 
identifying toxicant exposure at an early stage. 




state as 



4 . I have relied on expression profiling as a 
principal approach to these goals. Expression profiling, by 



reporting the expression levels of thousands of genes 
simultaneously, gives us an opportunity to identify and group 
toxicants based on similarities in the patterns of gene 
expression they induce in cells and tissues; the gene 
expression profiles induced by treatment with known testicular 
toxins serve as standards, molecular signatures or molecular 
fingerprints as it were, against which the patterns of gene 
expression induced by agents of unknown toxicity may be 
compared and judged. In addition, gene expression profiling 
may give us the opportunity to detect toxicity before more 
gross phenotypic changes become manifest. 

5. In keeping with this research emphasis, I have 
until recently: 

served on the Microarray Technical 
Subcommittee of the United States Environmental 
Protection Agency (EPA) Genomics Task Force, and 

served on the Scientific Committee for 
the conference series on "Critical Assessment of 
Techniques for Microarray Data Analysis," held 
annually at Duke University, Durham, NC; 



and I currently 



serve on the Technical Committee on the 
Application of Genomics to Mechanism-Based Risk 
Assessment of the International Life Sciences 
Institute's Health and Environmental Sciences 
Institute, 

serve on the Genomics and Proteomics 
Committee of the National Health and Environmental 
Effects Research Laboratory of the EPA's Office of 
Research and Development, 

belong to the [North Carolina Research] 
Triangle Array Users Group, 



belong to the Molecular Biology 
Speciality Section of the Society of Toxicology, 
and 

belong to the Triangle Consortium for 
Reproductive Biology, 

In addition, I am the principal investigator on a cooperative 
research and development agreement (CRADA) entitled 
"Development of a Genetic Test for Male Factor Infertility." 
Prior to this, I was a co-principal investigator on a 
materials cooperative research and development agreement 
(MCRADA) to print oligonucleotide-based microarrays; and from 
1999 - 2002, I was coinvestigator on a CRADA to develop gene 
microarrays for toxicology applications, 

6. I presume the reader's familiarity with the 
basic construction and operation of microarrays. For purposes 
of the discussion to follow, I use the phrase "nucleic acid 
microarray" and, equivalently , the term "microarray" to refer 
generically to the various types of nucleic acid microarray 
that include immobilized nucleic acid probes of sufficient 
length to permit specific binding, with minimal cross- 
hybridization, to the probe's cognate transcript, whether the 
transcript is in the form of RNA or DNA. Although this 
definition excludes microarrays having shorter probes, such as 
the 20-mer probes of arrays manufactured by Affymetrix, Inc., 
many of the comments that follow nonetheless apply to such 
microarrays as well. 

7. Although my own work with microarrays dates 
back only to 1998, and high density spotted nucleic acid 



microarrays themselves date back perhaps only to 1995, 1 
microarrays are by no means the only, nor the first, 
expression profiling tool. As I describe in detail in my 
Xenobiotica review, 3 there are a number of other differential 
expression analysis technologies that precede the development 
of microarrays, some by decades, and that have been applied to 
drug metabolism and toxicology research, including: 
(1) differential screening; (2) subtractive hybridization, 
including variants such as chemical cross-linking subtraction, 
suppression-PCR subtractive hybridization and representational 
difference analysis; (3) differential display; (4) restriction 
endonuclease facilitated analyses, including serial analysis 
of gene expression (SAGE) and gene expression fingerprinting; 
and (5) EST analysis. 

8. In my own earlier research, I used both 
reverse- transcriptase polymerase chain reaction (RT-PCR) and 
suppression-PCR subtractive hybridization (SSH) to study 
patterns of differential gene expression caused by hepatic 
challenge with nongenotoxic and genotoxic hepatotoxins. 3 



Schena et al., Quantitative monitoring of gene expression patterns 
with a complementary DNA microarray, ■ Science 270:467-470 (1995) attached 
hereto as Exhibit B. ' 

Rockett et al., -Differential gene expression in drug metabolism and 
toxicology: practicalities, problems and potential," Xenobiotica 29-655-691 
(1999) (hereinafter, -Xenobiotica review-), attached hereto as Exhibit C. 

See, e.g., Rockett et al., -Molecular profiling of non-genotoxic 
carcinogenesis using differential display reverse transcription polymerase 
chain reaction (ddRT-PCR) , • European J. Drug Metabolism & Pharmacokinetics 
22(4):329-33 (1997), and Rockett et al., -use of a suppression-PCR 
subtractive hybridization method to identify gene species which demonstrate 
altered expression in male rat and guinea pig livers following 3-day 
exposure to [4-chloro-6- (2, 3-xylidino) -2-pyrimidinylthio] acetic acid • 
Toxicology 144 (1-3) :13-29 (2000), attached hereto respectively as Exhibits 
D and E. 



9. These older transcript expression profiling 
techniques provide analogous expression data, but with far 
lower throughput. 

10. It has been well-established, at least since 
the introduction of high density spotted microarrays in 1995, 
that: 



(i) each probe on the microarray, with 
careful design and sufficient length, and with 
sufficiently stringent hybridization and wash 
conditions, binds specifically and with minimal 
cross-hybridization, to the probe's cognate 
transcript; 

(ii) each additional probe makes an 
additional transcript newly detectable by the 
microarray, increasing the detection range, and 
thus versatility, of this analytical device for 
gene expression profiling; 4 

(iii) it is not necessary that the 
biological function be known in order for the gene, 



The compelling logic of this proposition has likely motivated the 
remarkably rapid progress from the earliest high density spotted arrays in 
1995 (Schena et al., "Quantitative monitoring of gene expression patterns 
with a complementary DNA microarray," Science 270:467-470 (1995), attached 
hereto as Exhibit B) , to the first whole genome arrays in 1997 (Lashkari et 
al., "Yeast microarrays for genome wide parallel genetic and gene 
expression analysis," Proc. Natl. Acad. Sci. USA 94 (24 ): 13057-62 (1997) and 
DeRisi et al., "Exploring the metabolic and genetic control of gene 
expression on a genomic scale," Science 278 (5338) : 680-6 (1997), attached 
hereto as Exhibits F and G, respectively), to the concurrent announcement 
by two companies earlier this month of their respective commercial 
introductions of single chip human whole genome arrays (Pollack, "Human 
Genome Placed on Chip; Biotech Rivals Put it Up for Sale," The New York 
Times, Thursday, October 2, 2003 (Business Day), attached hereto as 
Exhibit H; "Agilent Technologies ships whole human genome on single 
microarray to gene expression customers for evaluation, " Press Release, 
Agilent Technologies, October 2, 2003, attached hereto as Exhibit I; 
"Af fymetrix Announces Commercial Launch of Single Array for Human Genome 
Expression Analysis; More Than 1 Million Probes Analyze Expression Levels 
of N arly 50,000 RNA Transcripts and Variants on a Single Array the Size of 
a Thumbnail," Press Release, Affymetrix, October 2, 2003, attached hereto 
as Exhibit J) . 



or a fragment of the gene, to prove useful as a 
prob on a microarray to be used for expression 
analysis; 

(iv) failure of a probe to detect changes 
in expression of its cognate gene does not diminish 
the usefulness of the probe on the microarray; and 

(iv) failure of a probe to detect a 
particular transcript in any single experiment does 
not deprive the probe of usefulness to the 
community of users who would use this research 
tool . 

These principles also apply to transcript expression profiling 
techniques that antedate the development of high density 
spotted microarrays, and accordingly were well-understood 
prior to 1995. 

11. Moreover, expression profiling is not limited 
to the measurement of mRNA transcript levels. It is widely 
understood among molecular and cellular biologists that 
protein expression levels provide complementary profiles for 
any given cell and cellular state. Although I cannot claim 
credit for having coined the phrase, I have written that the 
difference between transcript expression profiling and protein 
expression profiling is that " transcriptomics indicates what 
should happen, and proteomics shows what is happening. " 5 

12. For decades, such protein expression profiles 
have been generated using two dimensional polyacrylamide gel 



Rockett, "Macroresults through Microarrays," Drug Discovery Today 
7:804 - 805 (2002) (emphasis added), attached hereto as Exhibit K. 



electrophoresis (2D- PAGE) , and us d, among other things, to 
study drug effects.* 

13 . Although the protein expression profiles 
produced by 2D- PAGE analysis are analogous to the transcript 
expression profiles provided by nucleic acid microarrays, an 
even closer analogy is perhaps offered by antibody 
microarrays; as I note in my Drug Discovery Today commentary, 
such antibody microarrays date back to the work of Roger Ekins 
in the mid- to late-1980s. 7 

14. The principles in paragraph 10 also apply to 
protein expression profiling analyses, particularly to 
analyses performed using antibody microarrays. Thus, as with 
nucleic acid microarrays, the greater the number of proteins 
detectable, the greater the power of the technique; the 
absence or failure of a protein to change in expression levels 
does not diminish the usefulness of the method; and prior 
knowledge of the biological function of the protein is not 
required. As applied to protein expression profiling, these 
principles have been well understood since at least as early 
as the 1980s. 

15. Both gene and protein expression profiling are 
particularly useful to the toxicologist , especially in the 
pharmaceutical industry. Accordingly, I made the following 



* See, e.g., Anderson et al., "A two-dimensional gel database of rat 

liver proteins useful in gene regulation and drug effects studies," 
Electrophoresis 12:907 - 930 (1991), attached hereto as Exhibit L. 

7 See Ekins et al., J. Bioluminescence Chemi luminescence 5:59-78 

(1989); Ekins et al., Clin. Chem. 37: 1955-1965 (1991); and Ekins, U.S. 
Patent Nos. 5,432,099, 5,807,755, and 5,837,551, attached hereto 
respectively as Exhibits M to Q. 



statements in ray Xenobiotica revi w, written in th summer 
1998: 

[I]n the field of chemical-induced 
toxicity, it is now becoming increasingly obvious 
that most adverse reactions to drugs and chemicals 
are the result of multiple gene regulation, some of 
which are causal and some of which are casually- 
related to the toxicological phenomenon per se. 
This observation has led to an upsurge in interest 
in gene-profiling technologies which differentiate 
between the control and toxin- treated gene pools in 
target tissues and is, therefore, of value in 
rationalizing the molecular mechanisms of 
xenobiotic-induced toxicity. 

Knowledge of toxin -dependent gene 
regulation in target tissues is not solely an 
academic pursuit as much interest has been 
generated in the pharmaceutical industry to harness 
this technology in the early identification of 
toxic drug candidates, thereby shortening the 
developmental process and contributing 
substantially to the safety assessment of new 
drugs . 

For example, if the gene profile in 
response to say a testicular toxin that has been 
well-characterized in vivo could be determined in 
the testis, then this profile would be 
representative of all new drug candidates which act 
via this specific molecular mechanism of toxicity, 
thereby providing a useful and coherent approach to 
the early detection of such toxicants. 

Whereas it would be informative to know 
the identity and functionality of all genes up/down 
regulated by such toxicants, this would appear a 
longer term goal, as the majority of human genes 
have not yet been sequenced, far less their 
functionality determined. However, the current use 
of gene profiling yields a pattern of gene changes 
for a xenobiotic of unknown toxicity which may be 
matched to that of well-characterized toxins, thus 
alerting the toxicologist to possible in vivo 
similariti s between the unknown and the 
standard. . . . 



Despite the development of multiple 
technological advances which have recently brought 
the field of gene expression profiling to the 
forefront of molecular analysis, recognition of the 
importance of differential gene expression and 
characterization of differentially expressed genes 
has existed for many years. 



16. As noted in the preceding excerpt from my 
Xenobiotica review, expression profiling in toxicology studies 
yield patterns of changes that are characteristic of an agent 
of unknown toxicity, which patterns may usefully be matched to 
those of well-characterized toxins. 

17. In the context of such patterns of gene 
expression, each additional gene-specific probe provides an 
additional signal that could not otherwise have been detected, 
giving a more comprehensive, robust, higher resolution -- and 
thus more useful — pattern than otherwise would have been 
possible. 8 

18. It is my opinion, therefore, based on the state 
of the art in toxicology at least since the mid-1990s and 
as regards protein profiling, even earlier — that disclosure 
of the sequence of a new gene or protein, with or without 
knowledge of its biological function, would have been 



8 In a sense, each gene-specific probe used in such an analysis is 
analogous to a different one of the many parts of an engine, with each 
individual part, or subcombinations of such parts, deriving at least part 
of their usefulness from the utility of the completed combination, the 
functioning engine. 



sufficient information for a toxicologist to use the gene 
and/or protein in expression profiling studies in toxicology. 



19. The statements made in this declaration 



represent my individual views and are not intended to 
represent the opinion of my employer, the United States 
Environmental Protection Agency, or of any other branch of the 
federal government. Other than my current engagement to 
provide this declaration, I have neither had, nor currently 
have, financial ties to, or financial interest in, Incyte 
Corporation. I am not myself an inventor on any patent 
application claiming a gene or gene fragment. 



herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true, and 
further that these statements were made with the knowledge 
that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 
S ction 1001 of Title 18 of the United States Code and may 
jeopardize the validity of any patent application in which 
this declaration is filed or any patent that issues thereon. 



20. 



I declare further that all statements made 



John Coughlin Rockett III, Ph.D. 





Date 



.10- 



Exhibit A of Rockett Declaration 
with Response dated 1 1/18/03 
In USSN: 09/867,958 
PC-0041 CIP 



CURRICULUM VITAE 

PERSONAL DETAILS 



Name: 
Nationality: 
Work Address: 



John Coughlin Rockett IQ 
USA 

United States Environmental Protection Agency 

National Health and Environmental Effects Research Laboratory 

Reproductive Toxicology Division (MD-72) 

Gamete and Early Embryo Biology Branch 

Research Triangle Park 

NC 27711 

USA 



Work Telephone: +001 (919) 541 2678 
Work Fax: +001 (919) 541 4017 



W rk E-mail: 



rockett.iohn@epa.gov 



Employment and Higher Education 



CURRENT POSITION (12/00-present) 
Research Biologist 

Gamete and Early Embryo Biology Branch (MD-72) 
Reproductive Toxicology Division 

National Health and Environmental Effects Research Laboratory 

US Environmental Protection Agency 

Research Triangle Park 

NC 27711 

USA 



PREVIOUS POSITIONS 

8/98-12/00: NHEERL Post-Doctoral Research Fellow, Gamete and Early Embryo Biology 
Branch, Reproductive Toxicology Division, National Health and Environmental Effects 
Research Laboratory, United States Environmental Protection Agency, Research Triangle Park, 
NC, USA. 

Supervisors: Dr Sally P. Darney (Scientific publications under Sally D. Perreault) and Dr David 
J. Dix. 

5/95-7/98: Rhone-Poulenc Post-Doctoral Research Fellow, Molecular Toxicology Group, School 
of Biological Sciences, University of Surrey, Guildford, Surrey, England. 
Supervisor: Prof. G. Gordon Gibson. 



EDUCATION 

Ph.D., 1995 - University of Warwick, Coventry, W. Midlands, England 

Title: Transforming Growth Factor-P and Immune Recognition Molecules in Oesophageal 

Cancer. 

Supervisors: Dr Alan G. Morris (University of Warwick) and Dr S. Jane Darnton (Birmingham 
Heartlands Hospital) 

B.Sc. (Hons.), 1991 - University of Warwick, Coventry, W. Midlands, England. 

Degree: Microbiology and Microbial Technology (with intercalated year in industry), Class 2i. 

Tutor: Professor Howard Dalton. 



PROFESSIONAL ACTIVITIES 



Membership of Professional Societies: 

Society of Toxicology (Inc. Molecular Biology Speciality Section) (2001 -present) 
Science Advisory Board (2001 -present) 

North Carolina Chapter of the Society of Toxicology (1999-present) 

Triangle Consortium for Reproductive Biology (1999-present) 

Triangle Array Users Group ( 1 999-present) 

Institute of Biology (U.K.) (1989 - present) 

British Toxicology Society (1996 - 2000) 

Biochemical Society (U.K.) (1992-1995) 

British Society for Immunology (1 992- 1 995) 



Membership of Scientific Committees: 

International Life Sciences Institute's (ILSI) Health and Environmental Sciences Institute (HESI) 
Technical Committee on the Application of Genomics to Mechanism-Based Risk Assessment: 

• Steering Committee (5/02-present). 

• Hepatotoxicity Working Group Vice-Chair (5/02-present). 

• Hepatotoxicity Work Group Member (5/0 1 -present). 

Charter member, Fertility and Early Pregnancy Work Group of the National Children's Study 
(07/01-Present). 

National Health and Environmental Effects Research Laboratory Distinguished Lecture Series 
Committee (July 03-present). 

U.S. Environmental Protection Agency Genomics Task Force MicroaiTay Technical 
Subcommittee (August 03-present). 

National Health and Environmental Effects Research Laboratory Genomics and Proteomics 
Committee (NGPC) (September 03-present). 



Professional Meetings: 

Invited participant ("Observer") in Expert Panel Workshop: "The Role of Environmental Factors 
on the Onset and Progression of Puberty in Children". Organised by Serono Symposia 
International. November 6*-%^, 2003, Chicago, IL, USA. 

Joint organiser and co-chair of: "Genomic analysis of surrogate tissues for measuring toxic 
exposures and drug action", the "Innovations in Applied Toxicology" Symposium for the Society 
of Toxicology 42 nd Annual Meeting, March 9 th -! 3 th , 2003, Salt Lake City, UT, USA. 



(8) John C Rockett, David J. Esdaile and G Gordon Gibson (1999). Differential gene expression 
in drug metabolism: practicalities, problems and potential. Xenobiotica, 29(7):655-691. 
(7) MC Murphy, CN Brookes, JC Rockett, C Chapman, JA Lovegrove, BJ Gould, JW Wright and 
CM Williams (1999). The quantitation of lipoprotein lipase mRNA in biopsies of human adipose 
tissue, using the polymerase chain reaction, and the effect of increased consumption of n-3 
polyunsaturated fatty acids. European Journal of Clinical Nutrition, 53:441-447. 

(6) JC Rockett, DJ Esdaile and GG Gibson (1997). Molecular profiling of non-genotoxic 
carcinogenesis using differential display reverse transcription polymerase chain reaction (ddRT- 
?CR). European Journal of Drug Metabolism & Pharmacokinetics 22(4):329-33. 

(5) Rockett, J., Larkin, K., Darnton, S., Morris, A. and Matthews, H. (1997). Five newly 
established oesophageal carcinoma cell lines: phenotypic and immunological characterisation. 
British Journal of Cancer 75(2):258-263. 

(4) J C Rockett, S J Darnton, J Crocker, H R Matthews and A G Monis (1996). Lymphocyte 
infiltration in oesophageal carcinoma: lack of correlation with MHC antigens, ICAM-1, and tumour 
stage and grade. Journal of Clinical Pathology 49:264-267. 

(3) J C Rockett, S J Darnton, J Crocker, H R Matthews and A G Morris (1995). Expression of HL- 
ABC and HLA-DR histocompatability antigens and intercellular adhesion molecule- 1 in 
oesophageal carcinoma. Journal of Clinical Pathology 48:539-44. 

(2) Salam M, Rockett J and Morris A (1995). The prevalence of different human papillomavirus 
types and p53 mutations in laryngeal carcinomas: is there a reciprocal relationship? European 
Journal of Surgical Oncology 21:290-296. 

(1) Salam M, Rockett J and Morris A (1995). General primer-mediated polymerase chain reaction 
for simultaneous detection and typing of HPV in laryngeal carcinomas. Clinical Otolaryngology 
20:84-88. 



(2) Articles Submitted To A Scientific Journal 

(4) John C. Rockett, Judith E. Schmid, Christopher J. Luft, J. Brian Garges, M. Stacey Ricci, 
Pasquale Patrizio, Norman B. Hecht and David J. Dix. Gene Expression Patterns Associated with 
Infertility in Rodent and Human Models. * An invited submission* 

(3) Roger Ulrich, John C. Rockett, G. Gordon Gibson and Syril Pettit. Evaluating the Effects of 
Methapyrilene and Clofibrate on Hepatic Gene Expression: A Collaboration Between Laboratories 
and a Comparison of Platform and Analytical Approaches. 

(2) Valerie A Baker, Helen M Harries, Jeffrey F Waring, Roger Jolly, Angus de Souza, Judith E 
Schmid, Hong Ni, Roger Brown, Roger G Ulrich and John C Rockett. Clofibrate-Induced Gene 
Expression Changes in Rat Liver: A Cross-Laboratory Analysis Using Membrane cDNA Arrays. 
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(1 ) David Miller, Corrado Spadafora, David Dix, Adrian Platts, John C. Rockett, Stephen A 
Krawetz Nuclease digestion of sperm chromatin suggests a random distribution of gene sequences. 

(3) Articles In Preparation For Submission To A Scientific Journal 

(3) Spearow J, DB Tully, JC Rockett and DJ Dix. Differential testicular gene expression in mouse 
strains sensitive and resistant to endocrine disruption by estrogen. 

(2) Sally D. Perrault, John C. Rockett, Laura Fenster, James Kesner, Wendy Robbins and Steven 
Schrader. Biomarkers for Assessing Reproductive Development and Health: Part 2 - Adult 
Reproductive Health. 

(1) J. Christopher Luft, Douglas B. Tully, John C. Rockett, Judith E. Schmid and 
David J. Dix. Reproductive and genomic effects in testes from mice exposed to the water 
disinfectant byproduct bromochloroacetic acid 



(4) Book Chapters 

(4) J hn C. Rockett. Gene Microarrays Applied to Reproductive Toxicology. In Cunningham 
(Ed): Genetic and Proteomic Applications in Toxicity Testing, The Human Press, Totowa. In 
Preparation. * An invited submission * 

(3) John C. Rockett and David J Dix. Gene Expression Networks. In Cooper (ed-in-chief): 
Encyclopaedia of the Human Genome, Nature Publishing Group. London, New York. ISBN 0-333- 
80386-8 (2003). * An invited submission* 

(2) John C. Rockett. The Future of Toxicogenomics. In Michael Burczynski (ed): "An 
Introduction to Toxicogenomics". CRC Press. Boca Raton, London, New York, Washington D C 
pp299-3 1 7 (2003). * An invited submission* 

(1) J. Rockett, S. Darnton, J. Crocker, H. Matthews and A. Morris: Major Histocompatibility 
Complex (MHC) class I and JJ and Intercellular Adhesion Molecule (ICAM)-l expression in 
oesophageal carcinoma. Peracchia A, Rosati R, Bonavina L, Bona S, Chella B (eds): Recent 
Advances in Diseases of the Esophagus. Bologna: Monduzzi Editore, pp45-49 (1996). 



(5) Other Scientific Publications (Letters to Editors; Meeting Reports; Commentaries 
etc.) 

(1 1) John C. Rockett (2003). Probing the nature of microarray-based oligonucleotides. Drug 
Discovery Today 8(9):389. (A Letter To The Editor) * An invited submission* 

(10) John C. Rockett (2003). To confirm or not to confirm (microarray data) - that, is the question. 
Drug Discovery Today 8(8):343. (A Letter To The Editor) 
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(9B) Nazzareno Ballatori, James L. Boyer, and John C. Rockett. (2003), Exploiting Genome Data 
to Understand the Function, Regulation and Evolutionary Origins of Toxicologically Relevant 
Genes. Environ Health Perspect. 1 1 1(6):871-5. (A Meeting Report) 

(9A) Nazzareno Ballatori, James L. Boyer, and John C. Rockett. (2003). Exploiting Genome Data 
to Understand the Function, Regulation and Evolutionary Origins of Toxicologically Relevant 
Genes. EHP Toxicogenomics. 1 1 l(lT):61-5. (A Meeting Report) 

(8) John C. Rockett (2002). Surrogate Tissue Analysis for Monitoring the Degree and Impact of 
Exposures in Agricultural Workers. AgBiotechNet, 4:1-7 November, ABN 100. (A Review Article). 
* An invited submission* 

(7) John C. Rockett (2002). Macroresults Through Microarrays. Drug Discovery Today, 7(15);804- 
805. (A Meeting Report) 

(6) John C. Rockett (2002). Chip, chip, array! Three chips for post-genomic research. Drug 
Discovery Today, 7(8);458-459. (A Meeting Report) 

(5) John C. Rockett (2002). Use of Genomic Data in Risk Assessment. Ge/iomeBiology, 3(4): 
reports401 1.1-401 1.3 (http://genomebiologv.eom/2002/3/4/reports/401 l/?isguard=lV (A Meeting 
Report) 

(4) John C. Rockett (2001). Genomic and Proteomic Techniques Applied to Reproductive 
Biology. GenomeBiology 2(9): 4020.1-4020.3 (http://genomebiologv.eom/2001/2/9/reports/4020/) . 
(A Meeting Report) 

(3) John C. Rockett (2001). Chipping away at the mystery of drug responses. The 
Pharmacogenomics Journal, 1(3); 161 -163. (A commentary) * An invited submission* 

(2) Rockett, John C. and Dix, David J. (1999). U.S. EPA workshop: Application of DNA arrays to 
Toxicology. Environmental Health Perspectives, 107(8):68 1-685. (A Meeting Report) 

(1) John C. Rockett III (1995). Immune recognition molecules and transforming growth factor 
beta-1 in oesophageal cancer. Ph.D. thesis, University of Warwick, Coventry, England.CPA.D. 
thesis) 



(6) Published Book, Paper and Website reviews 

(9) John C. Rockett (2002). A report on the manuscript: Systemic RNAi in C. elegans requires the 
putative transmembrane protein SUM. Winston WM, Molodowitch C, Hunter CP. Science. 2002 
295:2456-2459. Ge/io/weBiology, 3(7):reports0034 
http://genomebiologv.eom/2002/3/7/reports/0034/ 



(8) John C Rockett (2001). A report on the manuscript: Genetic rescue of an endangered mammal 
by cross-species nuclear transfer using post-mortem somatic cells. P Loi ,.et al., Nat BiotechnoL 
2001, 19:962-964. GewomeBiology, 3(l):reports0006. 
fhttp://genomebiologv.com/2001/3/l/reports/0006/) . 

(7) John C Rockett (2001). A report on the manuscript: Molecular Classification of Human 
Carcinomas by Use of Gene Expression Signatures. A Su et al., Cancer Res. 2001 61 :7388-7393. 
GenomeBiology, 3(l):reports0005. (http://genomebiologv.eom/2001/3/l/reports/0005/) . 

(6) John C. Rockett (2001). A report on the manuscript: Genetic evidence for two species of 
elephant in Africa A Roca et al, Science. 2001 Aug 24;293(5534): 1473-7. GenomeBiology, 
2(12):reports0045. 0ittp://www.genomebiology.com/2001 /2/1 2/reports/0045/ . 

(5) John C. Rockett (2001). A report on the manuscript: Extensive genetic polymorphism in the 
human CYP2B6 gene with impact on expression and function in human liver. T Lang et al., 
Pharmacogenetics, 2001, 11(5):399-415. GenomeBio\ogy, 2(12):reports0044. 
flittp://www.genomebiology.com/2001/2/12/reports/0044/) . 

(4) John C. Rockett (2001). A report on the manuscript: Novel Human Testis-Specific cDNA: 
molecular Cloning, Expression and Immunological Effects of the Recombinant Protein. R 
Santhanam and R K Naz, Molecular Reproduction and Development 60:1-12 (2001). 
GenomeBiology, 2(ll):reports0040. (http://genomebiology.eom/2001/2/ll/reports/0040/) . 

(3) John C. Rockett (2001). A report on the website: BIND - The Biomolecular Interaction 
Network Database (http://www.bind.ca/) . GenomeBiology, 2(9): reports2011. 
http://www.genomebiology.eom/2001/2/9/reports/2011/ . 

(2) John C Rockett (2001). A report on the manuscript: Exploring the DNA-binding specificities 
of zinc fingers with DNA microarrays. ML Bulyk et al., Proc Natl Acad Sci USA 2001, 98:7158- 
7163. Ge/io/weBiology, 2(10): reports0032. fhttp://genomebiology.com/2001/2/10/reports/0032/) . 

(1) J Rockett (1996). A Book Review on: "Cell Adhesion and Cancer" (Eds., Hogg N. and Hart I.). 
Clinical Molecular Pathology 49(1):M64. * An invited submission* 



(7) Published Abstracts of Poster and Oral Presentations 

(17) Amber K. Goetz, Wenjun Bao, Judith E. Schmid, Carmen Wood, Hongzu Ren, Deborah S. 
Best, Rachel N. Murrell, John C. Rockett, Michael G. Narotsky, Douglas C Wolf, Douglas B. 
Tully, David J. Dix Gene Expression Profiling in Testis and Liver of Mice to Identify Modes of 
Action of Conazole Toxicities. Society of Toxicology 43 rd Annual Meeting, March 21 st -25 th , 2004, 
Baltimore, MD, USA. Toxicological Sciences. (Submitted) 

(16) Jane Gallagher, Theresa Lehman, Ramakrishna Modali, Scott Rhoney, Marien Clas, Jeff 
Inmon, John C. Rockett, David Dix, Cindy Mamay, Suzanne Fenton, Suzanne McMaster, Stan 
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Barone Jr, Pauline Mendola and Reeder Sams. Validation of Non-Invasive Biological Samples: 
Pilot Projects Relevant to the National Children Study. Society of Toxicology 43 Annual Meeting, 
March 21 st -25 lh , 2004, Baltimore, MD, USA. Toxicological Sciences. (Submitted) 

(15) B.S. Pukazhenthi, J. C. Rockett, M. Ouyang, DJ. Dix, J.G. Howard, P. Georgopoulos, WJ. J. 
Welsh and D. E. Wildt. Gene Expression In The Testis Of Normospermic Versus Teratospermic 
Domestic Cats Using Human cDNA Microarray Analyses. Society for the Study of Reproduction 
36 th Annual Meeting, July 19 th -22 nd , 2003, Cincinnati, OH, USA. Biology of Reproduction 68 (Supp 
1):191. 

(14) David J. Dix and John C. Rockett (2003). Genomic and Proteomic Analysis of Surrogate 
Tissues for Assessing Toxic Exposures and Disease States. Innovation in Applied Toxicology 
symposium entitled "Genomic and Proteomic Analysis of Surrogate Tissues for Assessing Toxic 
Exposures and Disease States". Society of Toxicology 42 nd Annual Meeting, March 9 th -13 lh , 2003, 
Salt Lake City, UT, USA. Toxicological Sciences 72(S-1):276. 

(13) John C. Rockett, Chad R. Blystone, Amber K. Goetz, Rachel N. Muirell, Judith E. Schmid 
and David J. Dix. (2003). Gene Expression Profiling Of Accessible Surrogate Tissues To Monitor 
Molecular Changes In Inaccessible Target Tissues Following Toxicant Exposure. Innovations in 
Applied Toxicology Symposium entitled "Genomic and Proteomic Analysis of Surrogate Tissues 
for Assessing Toxic Exposures and Disease States". Society of Toxicology 42 nd Annual Meeting, 
March 9*-13*, 2003, Salt Lake City, UT, USA. Toxicological Sciences 72(S-1):276. 

(12) Douglas B. Tully, J. Christopher Luft, John C Rockett, Judy E. Schmid and David J. Dix 
(2002). Effects on gene expression in testes from adult male mice exposed to the water disinfectant 
byproduct bromochloroacetic acid. Society for the Study of Reproduction 35 th Annual Meeting, July 
28-31 f 2002, Baltimore, Maryland, USA. Biology of Reproduction 66 (Supp 1):223. 

(11) David J. Dix, Kary E. Thompson, John C Rockett, Judith E. Schmid, Robert J. Goodrich, 
David Miller, G. Charles Ostermeier and Stephen A. Krawetz (2002). Testis and spermatazola RNA 
profiles of normal fertile men. Society for the Study of Reproduction 35 th Annual Meeting, July 28- 
31, 2002, Baltimore, Maryland, USA. Biology of Reproduction 66 (Supp 1):194. 

(10) Asa J. Oudes, John C. Rockett, David J. Dix and Kwan Hee Kim (2002). Identification of 
retinoic acid induced genes in mouse testis by cDNA microarray analysis. 27 th Annual Meeting of 
the American Society of Andrology, 4/24-27/02. J. Andrology Supplement March/ April. 

(9) John C Rockett, Robert J. Kavlock, Christy Lambright, Louise G. Parks, Judith E. Schmid, 
Vickie S. Wilson and David J. Dix (2002). Use of DNA arrays to monitor gene expression in blood 
and uterus from Long-Evans rats following 17-0-estradiol exposure - a new approach to 
biomonitoring endocrine disrupting chemicals using surrogate tissues. Toxicological Sciences 66(1): 
Abstract No. 1388. 

(8) David J. Dix and John C. Rockett (2002). Genomic analysis of the testicular toxicity of 
haloacetic acids. Platform presentation at the symposium, "Defining the cellular and molecular 



ii 



mechanisms of toxicant action in the testis". Toxicological Science 66 (1): Abstract No.848. 

(7) JC Rockett, JC Luft, JB Garges and DJ Dix (2001). The reproductive effects of the water 
disinfectant byproduct bromochloroacetate on juvenile and adult male mice. Toxicological 
Sciences, 60 (1):250. 

(6) Tarka DK, Klinefelter GR, Rockett JC, Suarez JD, Roberts NL and Rogers JM (2001). Effect 
of gestational expsore to ethane dimethane sulfonate (EDS), bromochloroacetic acid (BCA) and 
molinate on reproductive function in CD-I male mice. Toxicological Sciences, 60 (1):250. 

(5) Garges JB, Rockett JC and Dix DJ (2001). Developmental and reproductive phenotype of mice 
lacking stress-inducible 70 kDa heat shock proteins (Hsp70s). Toxicological Sciences, 60 (1):383. 
(4) D Dix, J Rockett, J Luft, J Garges, M Ricci, P Patrizio and N Hecht (2000). Using DNA 
microarrays to characterise gene expression in testes of fertile and infertile humans and mice. 
Biology of Reproduction, 62 (sl);227. 

(3) J Luft, J B Garges, J Rockett and D Dix (2000). Male reproductive toxicity of 
bromochloroacetic acid in mice. Biology of Reproduction, 62 (sl);246. 

(2) Rockett, JC, Garges, JB and Dix, DJ (2000). A single heat-shock of juvenile male mice causes 
a long-term decrease in fertility and reduces embryo quality. Toxicological Sciences 54 (1):365. 

(1) JC Rockett, SJ Damton, J Crocker, HR Matthews and AG Morris (1994). Major 
Histocompatability (MHC) class I and JJ and intercellular adhesion molecule (ICAM)-l expression 
in oesophageal carcinoma (OC). Immunology 83 (sl):64. 



(8) Invited Oral Presentations 

(10) John C. Rockett and Gary M Hellmann. To confirm or not to confirm (microarray data) - 
that is the question. Seminar for EPA/NHEERL Genomics and Proteomics Committee's ArrayQA 
forum, August 25 th , 2003, RTF, NC, USA. 

(9) John C. Rockett. "Biomonitoring Toxicant Exposure and Effect Using Toxicogenomics and 
Surrogate Tissue Analysis". Seminar for Division of Epidemiology, Statistics and Prevention 
Research, National Institute of Child Health and Development, May 29 th , 2003, Rockville, MD 
USA. 

(8) John C. Rockett. "Genomics and Proetomics: New Toxicity Testing". Platform presentation at 
US EPA Regional Risk Assessors Annual Conference, April 28 th - May 2 nd , 2003, Stone Mountain 
GA, USA. 

(7) John C. Rockett, Chad R. Blystone, Amber K. Goetz, Rachel N. Murrell, Judith E. Schmid and 
David J. Dix. "Gene Expression Profiling Of Accessible Surrogate Tissues To Monitor Molecular 
Changes in Inaccessible Target Tissues Following Toxicant Exposure." Platform presentation at 
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SoT 42 Annual Meeting symposium entitled "Genomic and Proteomic Analysis of Surrogate 
Tissues for Measuring Toxic Exposures and Drug Action", March 9 th - 13 th * 2003, Salt Lake City, 
UT,USA. 

(6) John C. Rockett. "A Toxicogenomic Approach to Surrogate Tissue Analysis". Seminar for 
Department of Environmental and Molecular Toxicology, North Carolina State University, 
September 3 rd , 2002, Raleigh, NC, USA. 

(5) John C. Rockett. "Differential gene expression in toxicology: practicalities, problems and 
potential". Platform presentation at 9 th Annual Mount Desert Island Biological Laboratory 
Environmental Health Sciences Symposium: Exploiting Genome Data to Understand the Function, 
Regulation and Evolutionary Origins of Toxicologically Relevant Genes, July 10^-1 1 th , 2002, 
Salisbury Cove, Maine, USA. 

(4) John C. Rockett, Leroy Folmar, Michael J. Hemmer and David J. Dix. "Arrays for 
biomonitoring environmental and reproductive toxicology". Platform Presentation at Macroresults 
Through Microarrays 3 - Advancing Drug Development, April 29 th -May 1 st , 2002, Boston, MA, 
USA. 

(3) John C. Rockett, Sigmund Degitz, Suzanne E. Fenton, Leroy Folmar, Michael J. Hemmer, Joe 
E Tietge, and David J. Dix. "Use of DNA Arrays in Environmental Toxicology". Platform 
presentation at the 4 th Annual Lab-on-a-Chip and Microarrays for Post-Genomic Applications 
meeting, January 14 th - 16 th , 2002, Zurich, Switzerland. 

(2) John C. Rockett. "DNA Arrays". Seminar at EPA Molecular Biology Course, April 8 th , 1999, 
USEPA, RTP, NC, USA. 

(1) John C. Rockett "Contract Services for Array Applications". Seminar at the Triangle Array 
Users Group, May 1 st , 1999, CUT, RTP, NC, USA. 



(9) Other Poster and Oral Presentations 

(23) John C. Rockett, Wenjun Bao, Chad R. Blystone, Amber K. Goetz, Rachel N. Murrell, 
Hongzu Ren, Judith E. Schmid, Jessica Stapelfeldt, Lillian F. Strader, Kary E. Thompson and David 
J. Dix. Genomic Analysis of Surrogate Tissues for Assessing Environmental Exposures and Future 
Disease States. ILSI-HESI meeting: Toxicogenomics in Risk Assessment - Assessing the Utility, 
Challenges, and Next Steps. June S*-^ 1 , 2003, Fairfax, VA, USA. 

(22) John C. Rockett, Wenjun Bao, Chad R. Blystone, Amber K. Goetz , Rachel N. Murrell, 
Hongzu Ren, Judith E. Schmid, Jessica Stapelfeldt, Lillian F. Strader, Kary E. Thompson and David 
J. Dix. Genomic Analysis of Surrogate Tissues for Assessing Environmental Exposures and Future 
Disease States. EPA Science Forum, May 5 th -7 lh , 2003, Washington, D.C., USA. 
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(21) Gennaine Buck, Courtney Johnson, Joseph Stanford, Anne Sweeney, Laura Schieve, John 
Rockett, Sherry Selevan and Steve Schrader. Prospective Pregnancy Study Designs for Assessing 
Reproductive and Developmental Toxicants. American Epidemiology Society Meeting, March 27 - 
28™ 2003, Atlanta, GA, USA. 

(20) John C. Rockett, Chad R. Blystone, Amber K. Goetz, Rachel N. Murrell, Hongzu Ren, Judith 
E. Schmid, Jessica Stapelfeldt, Lillian F. Strader, Kary E. Thompson, Doug B. Tully, Paul Zigas 
and David J. Dix. Genomic Analysis of Surrogate Tissues for Assessing Environmental Exposures 
and Future Disease States. National Children 's Study Assembly Meeting, December 1 6 th - 1 8 th , 2002, 
Baltimore, MD, USA. 

(19) John Rockett. The Use of Gene Expression Profiling to Detect Early Biomarkers of Adverse 
Effects Prior to Clinical manifestation. National Children 's Study: Meeting of EPA Project Leaders 
- Methods Development Projects. November 20 th , 2002, USEPA, RTP, NC, USA. (Oral 
Presentation) 

(18) GC Ostermeier, RJ Goodrich, K Thompson, J Rockett, MP Diamond, K Collins, NICHD 
Reproductive Medicine Network, DJ. Dix, D Miller and SA Krawetz. Defining the spermatozoal 
RNA population in normal fertile men. American Society of Reproductive Medicine October 12-17, 
2002, Seattle, WA, USA. 

(17) G. Charles Ostermeier, Robert J. Goodrich, Kary Thompson, John Rockett, Michael P. 
Diamond, Karen Collins, NICHD Reproductive Medicine Network, David J. Dix, David Miller and 
Stephen A. Krawetz. RNAs isolated from ejaculate spermatozoa provide a noninvasive means to 
investigate testicular gene expression. Gordon Conference on Mammalian Gametogenesis & 
Embryogenesis, June 30 th - July 5 th , Connecticut College, New London, CT, USA. 

(16) David Dix, John Rockett, Judith Schmid, Lillian Strader, Douglas Tully. Genomic analysis of 
testicular toxicity. USEPA/NHEERL/RTD Peer Review, October 22 nd , 2001 , RTP, NC, USA. 

(15) David Dix, John Rockett, Judith Schmid, Douglas Tully. Monitoring human reproductive 
health and development through gene expression profiling. USEPA/NHEERL/RTD Peer Review, 
October 22 nd , 2001 , RTP, NC, USA. 

(14) Patrizio P, N Hecht, J Rockett, J Schmid and D Dix (2001). DNA microarrays to study gene 
expression profiles in testis of fertile and infertile men. 57th Annual Meeting of the American 
Society for Reproductive Medicine, October 20 th -25 lh , 2001, Orlando, FL, USA. 

(13) Jimmy L. Spearow, Dale Morris, Uland Wong, Rashid Altafi, Saeed Eteiwi, Mark Stanford, 
Trevor Steams, Lorena Orozio, Angela Chen, John Rockett, Douglas Tully, David Dix and 
Marylynn Barkley. Genetic Variation In Susceptibility To The Disruption Of Testicular 
Development And Gene Expression By Pubertal Exposure To Estrogenic Agents. Third Annual 
University of California at Davis Conference for Environmental Health Scientists, Disruption of 
Developing Systems and Advances in Therapeutic Approaches August 27 th , 2001, UC Davis, CA, 
USA. 
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(12) Tarka DK, Klinefelter GR, Rockett JC, Suarez JD, Roberts NL and Rogers JM (2001). Effect 
of gestational expsore to ethane dimethane sulfonate (EDS), bromochloroacetic acid (BCA) and 
molinate on reproductive function in CD-I male mice. North Carolina Society of Toxicology Winter 
Meeting, March 3 rd , 2001. NIEHS, RTP, NC, USA. 

(1 1) David Dix, John Rockett, Leroy Folmar, Michael Hemmer, Sigmund Degitz, and Joseph 
Tietge (2001). Biomonitoring the Toxicogenomic Response to Endocrine Disrupting Chemicals in 
Humans, Laboratory Species and Wildlife. U.S. - Japan International Workshop for Endocrine 
Disrupting Chemicals, February 28 th -March 3 rd , 2001, Tsukuba, Japan. 

(10) John C. Rockett, Faye L. Mapp, J. Brian Garges, J. Christopher Luft, Chisato Mori and David 
J Dix David Dix (2001). The effects of hyperthermia on spermatogenesis, apoptosis, gene 
expression and fertility in adult male mice. Triangle Consortium for Reproductive Biology Annual 
Meeting, January 27 th , 2001, RTP, NC, USA. 

(9) Gangolli E, Dix DJ, Garges J B, Rockett, JC and Idzerda RL (2000). Testosterone Regulation 
of Sertoli Cell genes. 11 th International Congress of Endocrinology, October 29 th -November 2 nd , 
2000, Sydney, Australia. 

(8) J Rockett, J Luft, J Garges, M Ricci, P Patrizio, N Hecht and D Dix (2000). Using DNA 
microarrays to characterise gene expression in testes of fertile and infertile humans and mice. 
Functional Genomics & Microarray Data Mining, August 3 rd «4th th 2000, Durham, NC, USA. 

(7) Rockett JC, S Ricci, P Patrizio, NB Hecht, JB Garges and DJ Dix (2000). Gene Expression in 
the Mammalian Testis. 5 th NHEERL Symposium, June 6^-8*, 2000, RTP, NC, USA. 

(6) J Luft, J B Garges, J Rockett and D Dix (2000). Male reproductive toxicity of 
bromochloroacetic acid in mice. 2000 NIEHS/NTA Biomedical Science and Career Fair, April 28 th 
2000, RTP, NC, USA. 

(5) Rockett JC, S Ricci, P Patrizio, NB Hecht, JB Garges and DJ Dix (2000). Gene Expression in 
the Mammalian Testis. Molecular Toxicology, Toxicogenomics and Associated Bioinformatics 
Applied to Drug Discovery meeting, January 1 1 lh - 1 5 th , 2000,Santa Fe,NM, USA. 

(4) JC Rockett and DJ Dix (1999). Development of DNA arrays for the analysis of testis-expressed 
genes in humans and mice. The 8th Annual National Health and Environmental Effects Research 
Laboratory Open House. November 2 nd -3 rd , 1999, RTP, NC, USA. 

(3) JC Rockett, DJ Esdaile and GG Gibson (1997). Molecular profiling of non-genotoxic 
carcinogenesis using differential display reverse transcription polymerase chain reaction (ddRT- 
PCR). The British Toxicology Society Annual Meeting, April 19 th -22 nd , 1998, University of Surrey, 
Guildford, Surrey, England. 

(2) JC Rockett, DJ Esdaile and GG Gibson (1997). Molecular profiling of non-genotoxic 
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carcinogenesis using differential display reverse transcription polymerase chain reaction (ddRT- 
PCR). Poster presentation at Symposium on Drug Metabolism: Towards the next Millennium. 
August 26 !h -28 t \l997, London King's College, London, England. 

(1) J Rockett, S Darnton, J Crocker, H Matthews and A Morris: Major Histocompatibility Complex 
(MHC) class I and II and Intercellular Adhesion Molecule (ICAM)-l expression in oesophageal 
carcinoma. Oral presentation at The 6th World Congress of the International Society for Diseases of 
the Esophagus, August 23 rd -26 th , 1995, Milan, Italy. 
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Exhibit B of Rockett Declaration 
with Response dated 1 1/18/03 
In USSN: 09/867,958 

pc-0041 cn> 



27. 



31 



Arfip sequence tolowftg Ser 400 and occurs wttti 
the dorneh of Atfip that shows howttbgy wlh hOE 
(14). To deteta the complete STE23 sequence and 
oraatttrwctteSA^ACrruaton. potymeraGe chart 
reaction (PCR) prim en fc'-TCGGAAGACCTCAT- 
IU I bU I CATTTTGATATTGCTG- TGTAGATTG- 
TACTGAGAGTGCAC-3'; and S'-GCTACMAC^GC- 
GTCGACTTGMTGCCCCGACATCTTCGACTGT- 
GCGGTATT7CACAOCG-3') were uaad to ampffy 
the URA3 sequence of pRS3l6. and the reaction 
product w«s transformed Into yeast tor one-step gene 
raptacaiioi a P- Roffwten. Methods Enzymu 104, 
281 (1991]). To create the cxtf tr.iJEU2 nutation corv 
tainad on p1 14, a SiW* Sat I fragment from pAW. 1 
was cloned Into ptCiS, and an interna 4A*b Hpa 
l-^Xho I fragment was replaced vtftti a LfiLC fragment. 
To construct the ttnMA-i BJ2 aJete (a detetibn cor- 
ra sp onolnp to 931 arnho acids) earned on pi 53. a 
LEU? fragment was used to replace the 2S-kb Pml 
I-Ecf136 1 fragment of S7E23, which occurs with* a 
6jM6 Hnd K-6o1 B genomic fragment carried on 
pSP72 (Piomege). To create YEpAffar, a 1.6-Kb 
Bam HI fragment contdieiu MFA1. from pKK 16 pc 
Kucrsar. R £ Sterna. J. Tnorner. BMBO d 8. 3973 
0989}], was igaled into the Bam HI sleofYEp35l p. 
E. Hi, A. M. Myers. T. J. Koemer, A. Izagoktl, Yeast 
2,163(1966]). 

24. J. Chant and I Herekowitz. CeD 65, 1203(1991). 

25. B. W. Matthews, Act Chan Ass. 21. 333 (1986). 

26. K. KuchJar, H. G. Dohtman, J. Tnorner. J. CelBioL 
120, 1203 (1993); a Kofng and C P. Hollenberg, 
EUBOJ. 13, 3261 (1994); C. Bertcower, O. Loeyza. 
S. Michaels. A** flcV. Car fi, 1 185 (1994). 
A. Bander and J. R. Pringte. Proc. NatL Acad. Set. 
USA 86. 9976 (1989); J. Chant. K. Corrado. J. R 
Pnngte, I. Herskowrtz. CeB 65, 1213 (1991); S. 
Powers, E. Gonzales, T. Christensen, J. Cuban. D. 
Broek. bid., p. i225 ; H. O. Park. J. Chant. I. Her- 
akowttz. Nature 365. 269 (1993); J. Chant. Trends 

Genet 10. 328 (1994); and J. R Pringie. J 

CeMBiot. 129. 751 (1995); J. Chant M. Mschka. E. 
Mitchell. L Herskowrtz. J. R Pringie. t*J„ p. 767. 
G. F. Sprague Jr., Methods. Enzyme* 194, 77 
(1991). 

29. Si n ge) t ot ter abbrevi at ions for the arnho acid resi- 
dues are as latows: A, Ala; C. Cys; O, Asp; E, Gkr, F 
Phe; G. Gly: H. His; I. He; K, Lys; L. Leu; M, Met; N,' 
Asa P. Pro; Q, Gh; R Am; S. Sen T.Thr; V, VaJ: W. 
Trp;andY.Tyr. 

30. A W303 1A derivative, SY2625 (MATa ura3-J fau?<3. 
tl20pi-1adB2-icani-i00ssti&mto2teRBnbcZ 

search. SY262S os^Inbs tor trie matrtg assays, se- 
creted pheromone assays, and the pulse-chase exper- 
iments hduoed tie totowfrg stars: V49 (jte22-j), 
Y115 (m*7A^atg. Y142 (■*f7.vfjR43&, Y173 
fctttf ALLELE). Y220 s«B23A=LRA3). Y221 

tsta23a.vCWA3>, Y231 to4W.lBJ2 ste23o^L£U2). 
and Y233 &B23teLBJ%. MATo deriwaUwjs of 
SY2625 hduoad the foBowhg strains: Yigg 
(SY2625 made M4ra). Y278 (sfe22-f), Y195 
(mrala^Lae). Y196 ip»tlt::LEU2L and Y197 
(atf I.vLffW3). The EG 123 (U4Ta tatf ura3 trof can/ 
/>'s4) genetic backgnxrxJ was used to create a set of 
strains tor analysis of bud stta wofrctrm. EQ123 de- 
rivatives induded the totowhg strains: Y175 
ifixtlCL-LBJQ, Y223 \fix!1-jJRA3l. Y234 (s(a23A^ 
LBJZL and Y272 (a*ffA.*.l£LP sfa23A:.t£U2). 
MATo derivatives of EG123 routed the foftowing 
strains: Y214 (EG123 made AMTa) and Y293 
iflx?1&::LBJ2l AO strains ware generated by means 
of standard genetic or mdecUar melhoo* fr*oMng 
the appropriate constructs (23). h particular, the axff 
*te23 double mutant strains ware creeled by cross- 
hg of the appropriate MAT* steZJ and M4Ta exti 
mutants, followed by sporUatton of the resuttarrtcSp- 
bid end isolation of the doubts mutant from nonpe- 
rental cf-type tetrads. Gene disruptions were con- 
firmed with either PCR or Southern (DMA) analysis. 
P129 is a YEp352 (1 E. Hi, A. M Myers, T. J. Ko- 
emer. A. Tzagoloff, Yeasf 2, 1 63 (1 9363) pkairvi con- 
taring a 5.5-kb Sat j tagmant of pAXl 1. pl5l was 
derived from pi29 by irsertion of a Wear attheBgfl 
site withh AXL /, which tad to an in-frame insartbnof 
the hamagpUhin (HA) epftope ffXTrPttTVPOYA) (29) 
between arnho ados 654 and 855 oJ the AXL 1 prod- 



a Bam H^Sst I fragment fcunpAXI? Subs? 
tutjonrruatjensoftherjropo^ 

"^fggw ta hvoMng appropriate synthetic oegenu- 
S'<nGCTCACAAAGCGCT- 
GCCAAACCGGC-3'; wd1-£7lA, S'-AAGAATCAT- 
GTGCGCACAAAGGTGCGW; and axf7^77D 5». 
AAGAATCATGTGATCACAAAGGTC0GC-31- The 
muttbons were oonftinyd by sequence analysis. Af- 
ta*" mutagenesis, the 0.4-kb Bam HHWsc I fragment 
from the rruagenized pC225 piasrrids was trans- 
ferred no pAXL 7 to aeate a set of pRS31 6 ctisrrsds 
C *2*? J*??* AXL7 P124 frd1-H6&A). 

P130 (wri ^7iA). and pi 32 W-E71D>. Smlarty, a 
wjtof^KMagg^aleiasc-^ 
atedaAer reptonient of the pi5l Bam H»-Msc I 
fracroit. to oenerate p161 fcutf-£77A), p16 2 (anfT- 
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Quantitative Monitoring of Gene Expression 
Patterns with a Complementary DNA Microarray 

Mark Schena,* Dari ShaJon,*t Ronald W. Davis 
Patrick O. Brown* 

t?if 0 ? k f aCity system was developed to monitor the expression of many oenes in 



28. 



The Temporal, developmental, topographi- 
cal histological, and physiological patterns 
in which a gene is expressed provide clues to 
its biological role. The large and expanding 
database of complementary DNA (cDNA) 
sequences from many organisms (/) presents 
the opportunity of defining these patterns at 
the level of the whole genome. 

For these studies, we used the small flow, 
ering plant Arabidopsis thaliana as a model 
organism. Arabidopsis possesses many ad- 
vantages for gene expression analysis, in- 
cluding the fact that it has the smallest 
genome of any* higher eukaryote examined 
to date (2). Forty-five cloned Arooidopjts 
cDNAi (Table I), including 14 complete 
sequences and 31 expressed sequence tags 
(ESTs), were used as gene-specific targets. 
We obtained the ESTs by selecting cDNA 
clones at random from an Arabidopsis 
cDNA library. Sequence analysis revealed 
that 28 of the 31 ESTs matched sequences 
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in the database (Table 1). Three additional 
cDNAs from other organisms served as con- 
trols in the experiments. 

The 48 cDNAs, averaging -1.0 kb, 
were amplified with the polymerase chain 
reaction (PCR) and deposited into indi- 
vidual wells of a 96-well microliter plat*. 
Each sample was duplicated in two adja- 
cent wells to allow the reproducibility of 
the arraying and hybridization process to 
be tested. Samples from the microliter 
plate were printed onto glass microscope 
slides in an area measuring 3.5 mm by 5.5 
mm with the use of a high-speed arraying 
machine (3). The arrays were processed by 
chemical and heat treatment to attach the 
DNA sequences to the glass surface and 
denature them (3). Three arrays, printed 
in a single lot, were used for the experi- 

d/^S" hcrC * A SingIc microtit « plate of 
PCR products provides sufficient material 
to print at least 500 arrays. 

Fluorescent probes were prepared from 
total Araiwiopju mRNA (4) by a single 
round of reverse transcription (5). The Arc- 
bidopsis mRNA was supplemented with hu- 
man acetylcholine receptor (AChR) mRNA 
at a dilution of 1 : 10,000 (w/w) before cDNA 
synthesis, to provide an internal standard for 
calibration (5). The resulting fluorescently 
labeled cDNA mixture was hybridized to an 
array at high stringency (6) and scanned 

467 



with a laser (3). A high-sensitivity scan gave 
signals that saturated the detector at nearly 
all of the Arabidopsis target sites (Fig. 1A). 
Calibration relative to the AChR mRNA 
standard (Fig. 1A) established a sensitivity 
limit of - 1 :50,000. No detectable hybridba- 
tion was observed to either the rat glucocor- 
ticoid receptor (fig. 1A) or the yeast TRP4 
(Fig. 1A) targets even at the highest scan- 
ning sensitivity. A moderate-sensitivity scan 



of the same array allowed linear detection of 
the more abundant transcripts (Fig. IB). 
Quantitation of both scans revealed a range 
of expression levels spanning three orders of 
magnitude for the 45 genes tested (Table 2). 
RNA blots (7) for several genes (fig. 2) 
corroborated the expression levels measured 
with the microarray to within a factor of 5 
(Table 2). 

Differential gene expression was investi- 
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gated with a simultaneous, two-color hy- 
bridization scheme, which served to mini- 
mixe experimental variation inherent in the 
comparison of independent hybridizations, 
Huorescent probes were prepared from two 
mRNA sources with the use of reverse tran- 
scriptase in the presence of fluorescein- and 
lissamine-labcled nucleotide analogs, re- 
spectively (5). The two probes were then 
mixed together in equal proportions, hy- . 
bridized to a single array, and scanned sep- 
arately for fluorescein and lissamine emis- 
sion after independent excitation of the two 
fluorophores (3). 

To test whether overexpression of a sin- 
gle gene could be detected in a pool of total 
Arabidopsis mRNA, we used a microarray to 
analyze a transgenic line overexpressing the 
single transcription factor HAT4 (8). Fluo- 
rescent probes representing mRNA from 
wild-type and HAT^-transgenic plants were 
labeled with fluorescein and lissamine, re- 
spectively; the two probes were then mixed 
and hybridized to a single array. An intense 
hybridization signal was observed at the 
position of the HAT* cDNA in the lissa- 
mine-specific scan (Fig. ID), but not in the 
fluorescein-specific scan of the same array 
(Fig. 1C). Calibration with AChR mRNA 
added to the fluorescein and lissamine 
cDNA synthesis reactions at dilutions of 
1:10,000 (Fig. 1C) and 1:100 (Fig. ID), 
respectively, revealed a 50-fold elevation of 
HAT4 mRNA in the transgenic line rela- 
tive to its abundance in wild-type plants 
(Table 2). This magnitude of HAT4 over- 
expression matched that inferred from the 
Northern (RNA) analysis within a factor of 
2 (Fig. 2 and Table 2). Expression of all the 
other genes monitored on the array differed 
by less than a factor of 5 between HAT4- 
transgenic and wild-type plants (Fig 1, C 
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H4T4bansywric 



CABI 



HAT4 



ROC1 




1.0 0.1 0.01 1,0 ai 0.01 

mRNA (mq) 




20 2J0 0.2 
mRNA (rig) 

Fig. 2. Gene expression monitored with RNA 
(Northern) blot analysis. Designated amounts of 
mRNA from wild-type and H474-transgenc 
plants were spotted onto nylon membranes and 
probed with the cONAs indicated. Purified human 
AChR mRNA was used for calibration. ■ 



and D, and Tabic 2). Hybridization of flu. 
orescein-labeled glucocorticoid receptor 
cDNA (Fig. 1C) and lissamine-labeled 
TRP4 cDNA (Fig. ID) verified the pres. 
ence of the negative control targets and the 
lack of optical cross talk between the two 
fluorophores. 

To explore a more complex alteration in 
expression patterns, we performed a second 
two-color hybridization experiment with 
fluorescein, and lissamine-labeled probes 
prepared from root and leaf mRNA, respec- 
tively. Tht scanning sensitivities for the 
two fluorophores were normalized by 
matching die signals resulting from AChR 



mRNA, which was added to both cDNA 
synthesis reactions at a dilution of 1:1000 
(Fig. 1, E and F). A comparison of the scans 
revealed widespread differences in gene ex. 
pression between root and leaf tissue (Fig. 1, 
E and F). The mRNA from the light-regu- 
lated CABI gene was ^500-fold more abun- 
dant in leaf (Fig. IF) than in root tissue 
(Fig. IE). The expression of 26 other genes 
differed between root and leaf tissue by 
more than a factor of 5 (Fig. 1, E and F). 

The HAT4-transgenic line we examined 
has elongated hypocotyls, early flowering, 
poor germination, and altered pigmentation 
W. Although changes in expression were 



Table 1. Sequences contained on the cONA microarray Shown is the rw^ 

function, and the accession number of each cnhiT^^ZZ^ J* position, the known or putative 
in this study rr*tchedTs£^ 

****** ATFase, ade^sne tnphos^^S, £^ Wt ™ ide ***** 

Position cDNA 



Function 



a1.2 
a3.4 
a5, 6 
a7.8 
a9, 10 
a11. 12 
D1.2 
D3.4 
b5.6 
b7.8 
09. 10 
011, 12 
C1.2 
C3. 4 
C5, 6 
c7,8 
cfi. 10 
c11. 12 

d1.2 

d3.4 

d5,6 

d7,8 

d9, 10 

011. 12 

61,2 

e3,4 

e5.6 

67,8 

e9, 10 

en, 12 

II, 2 
13.4 
15, 6 
f7,8 
19.10 

III. 12 

91.2 

93.4 

95.6 

97.8 
99.10 
911.12 
hi. 2 
h3,4 
h5.6 
h7,8 
h9. 10 
Ml. 12 



Accession 
number 



AChR 

EST3 

EST6 

AAC1 

EST12 

EST13 

CABI 

EST17 

GA4 

EST19 

GBF-1 

EST23 

EST29 

GBf-2 

EST34 

EST35 

EST41 

rGR 

EST42 

EST45 

HATl 

EST46 

EST49 

HATS 

HAT4 

EST50 

HATS 

EST51 

HA 722 

EST52 

EST59 

KNAT1 

EBT60 

EST69 

PPH1 

EST70 

EST 75 

EST 78 

ROC1 

EST82 

ESTB3 

EST84 

ESTB1 

EST96 

SARI 

EST100 

EST 103 

TRP4 



Human AChR 
Actin 

NADH dehydrogenase 
Actin i 
Unknown 
Actin 

Chlorophyll a/b binding 
Phosphoglycerate kinase 
Gtoberellic acid biosynthesis 
Unknown 

G-box binding factor 1 
Elongation factor 
Aldolase 

G-box binding factor 2 
Chtoroptast protease 
Unknown 
Catalase 

Rat glucocorticoid receptor 
Unknown 
ATPase 

Homeobcoc-teucine zipper 1 
Light harvesting complex 
Unknown 

Homeobox-teucine zipper 2 
Homeobox-leuoine zipper 4 
PhosprK>ributokinase 
Homeobox-teucine zpper 5 
Uhknown 

Homeobox -leucine zipper 22 
Oxygen evolving 
Unknown 

Kootted-like homeobox 1 
RuBisCO small subunit 
Transiation elongation factor 
Protein phosphatase 1 
Unknown 

Chkxoplast protease 
Unknown 
Cyctophflin 
GTP binding 
Unknown 
Unknown 
Unknown 
Uhknown 
Synaptobrevin 
Ught harvesting complex 
Light harvesting complex 

Veast tryptophan biosynthesis 

b « Strata^ MJota. California). INo mat* n the database: novel EST. 



H36236 

227010 

M20016 

U36594T 

T45783 

M85150 

T44490 

L37126 

U36595t 

X63894 

X52256 

T04477 

X63895 

R87034 

T14152 

T22720 

Ml 4053 

U35596t 

J04185 

U09332 

T04063 

T 76267 

U09335 

M90394 

T04344 

M90416 

233675 

U09336 

T21749 

234607 

U14174 

X14564 

T42799 

U34803 

T44621 

T43698 

R65481 

L14844 

X59152 

233795 

T45276 . 

T13832 

R64816 

M90418 

218205 

X03909 

X04273 



observed for HAT4, luge changes In ex. 
pression were not observed for any of die 
°u*r 44 genes we examined. This was 
somewhat surprising, particularly because 
comparative analysis of leaf and root tissue 
identified 27 differentially expressed genes 
Analysis of an expanded set of genes may be 
required to identify genes whose expression 
changes upon HAT4 overexpression; alter- 
natively, a comparison of mRNA popula- 
tions from specific tissues of wild-type and 
HAH-transgenic piano may allow identi- 
fication of downstream genes. 

At the current density of robotic printing, 
it is feasible to scale up the fabrication pro- 
cess to produce arrays containing 20,000 
cDNA targets. At this density, a single array 
would be sufficient to provide gene^pecific 
targets ertconipassing nearly the entire rep- 
ertoire of expressed genes in the Araixdopsis 
genome (2). The availability of 20,274 ESTs 
from Arabidopsis (J, 9) would provide a rich 
source of templates for such studies. 

The estimated 100,000 genes in the hu- 
man genome (10) exceeds the number of 
Arabidopsis genes by a factor of 5 (2). This 
modest increase in complexity suggests that 
similar cDNA microarrays, prepared from 
the rapidly growing repertoire of human 
ESTs (J), could be used to determine the 
expression patterns of tens of thousands of 
human genes in diverse cell types. Coupling 
an amplification strategy to the reverse 
transcription reaction (J I) could make it 
feasible to monitor expression even in 
minute tissue samples. A wide variety of 
acute and chronic physiological and patho- 
logical conditions might lead to character- 
istic changes in the patterns of gene expres- 
sion in peripheral blood cells or other easily 
sampled tissues. In concert with cDNA mi- 
croarrays for monitoring complex expres- 
sion patterns, these tissues might therefore 
serve as sensitive in vivo sensors for clinical 
diagnosis. Microarrays of cDNAs could thus 
provide a useful link between human gene 
sequences and clinical medicine. 



Table 2. Gene expression rroritoring by microar- 
ray and RNA blot analyses; tg, HAT^transoenic 
See Table 1 for additional gene htormatorf^-" 
pression levels (w/w) were calibrated with the use 
of known amounts of human AChR mRNA. Values 
for the microarray were determined from mtcroar- 
ray scans (Rg. 1); values for the RNA blot were 
determined from RNA blots (Rg 2) 
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Gene 


Expression le\ 
Microa/ray 


✓eJfwAv) 
RNA blot 


CABI 
CABI (tg) 
HAT 4 
HAT4 (tg) 
ROCl 
ROC1 (tg) 


1:48 

1:120 

1:8300 

1:150 

1:1200 

1560 


1:83 

1:150 

1:6300 

1:210 

1:1800 

1:1300 
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Gene Therapy in Peripheral Blood 
Lymphocytes and Bone Marrow for 
ADA Immunodeficient Patients 

Claudio Bordignon,* Luigi D, Notarangelo, Nadia Nobili 
Giuhana Ferrari, Giulia Casorati, Paola Panina, Evelina Mazzolari, 
Daniela Maggioni, Claudia Rossi, Paolo Servida, 
Alberto G. Ugazio, Fulvio Mavilio 

Adenosine deaminase (ADA) deficiency results in severe combined Immunodeficiency, 
the first genetic disorder treated by gene therapy. Two different retroviral vectors were 
used to transfer ex vivo the human ADA minigene into bone marrow cells and peripheral 
Dlood lymphocytes from two patients undergoing exogenous enzyme replacement ther- 
apy. After 2 years of treatment, long-term survival of T and B lymphocytes, marrow ceils 
and granulocytes expressing the transferred ADA gene was demonstrated and resulted 
in i normalization of the immune repertoire and restoration of cellular and humoral immunity 
Atter discontinuation of treatment, T lymphocytes, derived from transduced peripheral" 
wood lymphocytes, were progressively replaced by marrow-derived T cells in both pa- 
tients. These results indicate successful gene transfer into long-lasting progenitor cells 
producing a functional multiiineage progeny. 



Severe combined immunodeficiency asso- 
ciated with inherited deficiency of ADA 

(1) is usually fatal unless affected children 
are kept in protective isolation or the im- 
mune system is reconstituted by bone mar- 
row transplantation from a human leuko- 
cyte antigen (HLAMdentical sibling donor 

(2) . This is the therapy of choice, although 
it is available only for a minority of patients. 
In recent years, other forms of therapy have 
been developed, including transplants from 
haploidentical donors (3,4), exogenous en- 
xyme replacement (5), and somatic-cell 
gene therapy (6-9). 

We previously reported a preclinical mod- 
el in which ADA gene transfer and expression 
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successfully restored immune functions in hu- 
man ADA-deficient (ADA") rxriphexal 
blood lymphocytes (PBLs) in inAmunodefi- 
cient mice in vivo (JO, i J J. On the »™i> of 
these preclinical results, the clinical applica- 
tion of gene merapy for the treatment of 
ADA" SOD (severe combined irniruinodefi- 
ciency disease) patients who previously failed 
exogenous enzyme replacement therapy was 
approved by our Institutional Ethical Com- 
mittees and by the Italian National Commit- 
tee for Bioethics (12). In addition to evaluat- 
ing the safety and efficacy of the gene therapy 
procedure, the aim of the study was to define 
the relative role of PBLs and hematopoietic 
stem cells in the long-term reconstiturion of 
immune functions after retroviral vector-me- 
diated ADA gene transfer. For this purpose, 
two structurally identical vectors expressing 
the human ADA complementary DNA 
(cDNA), distinguishable by the presence of 
alternative restriction sites in a nonfunctional 
region of the viral long-terminal repeat 
(LTR), were used to transduce PBLs and bone 
marrow (BM) cells independently. This pro- 
cedure allowed identification of the origin of 
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Differential gene expression in drug metabolism and 
toxicology: practicalities, problems and potential 

JOHN C. ROCKETTt, DAVID j; ESDAILE} 
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1 . An important feature of the work of many molecular biologists is identifying which 
genes are switched on and off in a cell under different environmental conditions or 
subsequent to xenobiotic challenge. Such information has many uses, including the 
deciphering of molecular pathways and facilitating the development of new experimental 
and diagnostic procedures. However, the student of gene hunting should be forgiven for 
perhaps becoming confused by the mountain of information available as there appears to be 
almost as many methods of discovering differentially expressed genes as there are research 
groups using the technique. 

2. The aim of this review was to clarify the main methods of differential gene expression 
analysis and the mechanistic principles underlying them. Also included is a discussion on 
some of the practical aspects of using this technique. Emphasis is placed on the so-called 
™Z ffi 1 ? 5 '!. n equire " n P ? 0r ^^dge of the genes contained within the study 
model. Whilst these wdl eventually be replaced by 'closed ' systems in the study of human 
mouse and other commonly studied laboratory animals, they will remain a powerful tool for 
those examining less fashionable models. 

3. The use of suppression-PCR subtractive hybridization is exemplified in the 
identification of up- and down-regulated genes in rat liver following exposure to pheno- 
barbital, a well-known inducer of the drug metabolizing enzymes. 

4. Differential gene display provides a coherent platform for building libraries and 
microchip arrays of 'gene fingerprints' characteristic of known enzyme inducers and 
xenobiotic toxicants, which may be interrogated subsequently for the identification and 
characterization of xenobiotics of unknown biological properties 



Introduction 

It is now apparent that the development of almost all cancers and many non- 
neoplastic diseases are accompanied by altered gene expression in the affected cells 
compared to their normal state (Hunter 1991, Wynford-Thomas 1991, Vogelstein 
and Kinzler 1 993, Semenza 1 994, Cassidy 1 995, Kleinjan and Van Hegningen 1 998) 
Such changes also occur in response to external stimuli such as pathogenic micro- 
organisms (Rohn et al. 1996, Singh et al. 1997, Griffin and Krishna 1998, Lunney 
lllll xenobiotics < Sewa " * **• 1995, Dogra et al. 1998, Ramana and Kohli 
1998), as well as during the development of undifferentiated cells (Hecht 1998 
Rudin and Thompson 1998, Schneider-Maunoury et al. 1998). The potential' 
medical and therapeutic benefits of understanding the molecular changes which 
occur in any given cell in progressing from the normal to the 'altered' state are 
enormous. Such profiling essentially provides a 'fingerprint' of each step of a 
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cell's development or response and should help in the elucidation of specific and 
sensitive biomarkers representing, for example, different types of cancer or previous 
exposure to certain classes of chemicals that are enzyme inducers. 

In drug metabolism, many of the xenobiotic-metabolizing enzymes (including 
the well-characterized isoforms of cytochrome P450) are inducible by drugs and 
chemicals in man (Pelkonen et al. 1998), predominantly involving transcriptional 
activation of not only the cognate cytochrome P450 genes, but additional cellular 
proteins which may be crucial to the phenomenon of induction. Accordingly, the 
development of methodology to identify and assess the full complement of genes 
that are either up- or down-regulated by inducers are crucial in the development of 
knowledge to understand the precise molecular mechanisms of enzyme induction 
and how this relates to drug action. Similarly, in the field of chemical-induced 
toxicity, it is now becoming increasingly obvious that most adverse reactions to 
drugs and chemicals are the result of multiple gene regulation, some of which are 
causal and some of which are casually-related to the toxicological phenomenon per 
se. This observation has led to an upsurge in interest in gene-profiling technologies 
which differentiate between the control and toxin-treated gene pools in target tissues 
and is, therefore, of value in rationalizing the molecular mechanisms of xenobiotic- 
mduced toxicity. Knowledge of toxin-dependent gene regulation in target tissues is 
not solely an academic pursuit as much interest has been generated in the 
pharmaceutical industry to harness this technology in the early identification of toxic 
drug candidates, thereby shortening the developmental process and contributing 
substantially to the safety assessment of new drugs. For example, if the gene profile 
in response to say a testicular toxin that has been well-characterized in vivo could be 
determined in the testis, then this profile would be representative of all new drug 
candidates which act via this specific molecular mechanism of toxicity, thereby 
providing a useful and coherent approach to the early detection of such toxicants. 
Whereas it would be informative to know the identity and functionality of all genes 
up/down regulated by such toxicants, this would appear a longer term goal, as the 
majority of human genes have not yet been sequenced, far less their functionality 
determined. However, the current use of gene profiling yields a pattern of gene 
changes for a xenobiotic of unknown toxicity which may be matched to that of well- 
characterized toxins, thus alerting the toxicologist to possible m vivo similarities 
between the unknown and the standard, thereby providing a platform for more 
extensive toxicological examination. Such approaches are beginning to gain 
momentum, in that several biotechnology companies are commercially producing 
'gene chips' or *gene arrays' that may be interrogated for toxicity assessment of 
xenobiotics. These chips consist of hundreds/thousands of genes, some of which are 
degenerate in the sense that not all of the genes are mechanistically-related to any 
one toxicological phenomenon. Whereas these chips are useful in broad-spectrum 
screening, they are maturing at a substantial rate, in that gene arrays are now 
becoming more specific, e.g. chips for the identification of changes in growth factor 
families that contribute to the aetiology and development of chemically-induced 
neoplasias. 

Although documenting and explaining these genetic changes presents a 
formidable obstacle to understanding the different mechanisms of development and 
disease progression, the technology is now available to begin attempting this difficult 
challenge. Indeed, several 'differential expression analysis' methods have been 
developed which facilitate the identification of gene products that demonstrate 
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altered expression in cells of one population compared to another. These methods 
have been used to identify differential gene expression in many situations, including 
invading pathogenic microbes (Zhao et al. 1998), in cells responding to extracellular 
and intracellular microbial invasion (Diiguid and Dinauer 1990, Ragno et al. 1997, 
Maldarelli et al. 1998), in chemically treated cells (Syed et al. 1997, Rockett et al. 
1999), neoplastic cells (Liang et al. 1992, Chang and Terzaghi-Howe 1998), 
activated cells (Gurskaya et al. 1996, Wan et al. 1996), differentiated cells (Hara et 
al. 1991, Guimaraes et al. 1995a, b), and different cell types (Davis et al. 1984 
Hedrick et al. 1984, Xhu et al. 1998). Although differential expression analysis 
technologies are applicable to a broad range of models, perhaps their most important 
advantage is that, in most cases, absolutely no prior knowledge of the specific genes 
which are up- or down-regulated is required. 

The field of differential expression analysis is a large and complex one, with 
many techniques available to the potential user. These can be categorized into 
several methodological approaches, including: 

(1) Differential screening, 

(2) Subtractive hybridization (SH) (includes methods such as chemical cross- 
linking subtraction— CCLS, suppression-PCR subtractive hybridization— 
SSH, and representational difference analysis— RDA), 

(3) Differential display (DD), 

(4) Restriction endonuclease facilitated analysis (including serial analysis of gene 
expression— SAGE— and gene expression fingerprinting— GEF), 

(5) Gene expression arrays, and 

(6) Expressed sequence tag (EST) analysis. 

The above approaches have been used successfully to isolate differentially 
expressed genes in different model systems. However, each method has its own 
subtle (and sometimes not so subtle) characteristics which incur various advantages 
and disadvantages. Accordingly, it is the purpose of this review to clarify the 
mechanistic principles underlying the main differential expression methods and to 
highlight some of the broader considerations and implications of this very powerful 
and increasingly popular technique. Specifically, we will concentrate on the so- 
called 'open' systems, namely those which do not require any knowledge of gene 
sequences and, therefore, are useful for isolating unknown genes. Two 'closed' 
systems (those utilising previously identified gene sequences), EST analysis and the 
use of DNA arrays, will also be considered briefly for completeness. Whilst 
emphasis will often be placed on suppression PCR subtractive hybridization (SSH, 
the approach employed in this laboratory), it is the aim of the authors to highlight,' 
wherever possible, those areas of common interest to those who use, or intend to use! 
differential gene expression analysis. 



Differential cDNA library screening (DS) 

Despite the development of multiple technological advances which have recently 
brought the field of gene expression profiling to the forefront of molecular analysis, 
recognition of the importance of differential gene expression and characterization of 
differentially expressed genes has existed for many years. One of the original 
approaches used to identify such genes was described 20 years ago by St John and 
Davis (1979). These authors developed a method, termed 'differential plaque filter 
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hybridization', which was used to isolate galactose-inducible DNA sequences from 
yeast. The theory is simple: a genomic DNA library is prepared from normal, 
unstimulated cells of the test organism /tissue and multiple filter replicas are 
prepared. These replica blots are probed with radioactively (or otherwise) labelled 
complex cDNA probes prepared from the control and test cell mRNA populations. 
Those mRNAs which are differentially expressed in the treated cell population will 
show a positive signal only on the filter probed with cDNA from the treated cells. 
Furthermore, labelled cDNA from different test conditions can be used to probe 
multiple blots, thereby enabling the identification of mRNAs which are only up- 
regulated under certain conditions. For example, St John and Davis (1979) screened 
replica filters with acetate-, glucose- and galactose-derived probes in order to obtain 
genes induced specifically by galactose metabolism. Although groundbreaking in its 
time this method is now considered insensitive and time-consuming, as up to 2 
months are required to complete the identification of genes which are differentially 
expressed in the test population. In addition, there is no convenient way to check 
that the procedure has worked until the whole process has been completed. 



Subtractive Hybridization (SH) 

The developing concept of differentia] gene expression and the success of early 
approaches such as that described by St John and Davis (1979) soon gave rise to a 
search for more convenient methods of analysis. One of the first to be developed was 
SH, numerous variations of which have since been reported (see below). In general, 
this approach involves hybridization of mRNA /cDNA from one population (tester) 
to excess mRNA/cDNA from another (driver), followed by separation of the 
unhybridized tester fraction (differentially expressed) from the hybridized common 
sequences. This step has been achieved physically, chemically and through the use 
of selective polymerase chain reaction (PCR) techniques. 



Physical separatum 

Original subtractive hybridization technology involved the physical separation 
of hybridized common species from unique single stranded species. Several methods 
of achieving this have been described, including hydroxyapatite chromatography 
(Sargent and Dawid 1983), avidin-biotin technology (Duguid and Dinauer 1990) 
and oligodT-latex separation (Hara et al. 1991). In the first approach, common 
mRNA species are removed by cDNA (from test cells)-mRNA (from control cells) 
subtractive hybridization followed by hydroxyapatite chromatography, as hydroxy- 
apatite specifically adsorbs the cDNA-mRNA hybrids. The unabsorbed cDNA is 
then used either for the construction of a cDNA library of differentially expressed 
genes (Sargent and Dawid 1983, Schneider et al. 1988) or directly as a probe to 
screen a preselected library (Zimmerman et al. 1980, Davis et al. 1 984, Hedrick et al. 
1984). A schematic diagram of the procedure is shown in figure 1. 

Less rigorous physical separation procedures coupled with sensitivity enhancing 
PCR steps were later developed as a means to overcome some of the problems 
encountered with the hydroxyapatite procedure. For example, Daguid and Dinauer 
(1990) described a method of subtraction utilizing biotin- affinity systems as a means 
to remove hybridized common sequences. In this process, both the control and 
tester mRNA populations are first converted to cDN A and an adaptor (' oligovector ', 
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Control (driver) mRNA Tester (test) cDNA (1st strand) 



-AAAA TTTTT 

-AAAA TTTTT 

"AAAA TTTTT 

-AAAA TTTTT 



Mix (ratio >35:1)& hybridize 

•AAAA 



AAAA 

-AAAA AAAA 



Hydroxyapatite chromatography — > RNAxDNA hybrids removed 



Unhybridized 

cDNA (differentially expressed) t \\V 

and mRNA **** 

AAAA 



I 

Sepharose CL6B exclusion — > Small cDNA fragments (<450bp) 
chromatography 

i 

Enriched, differentially expressed cDNA 
or 

Produce clones Label directly and probe library 

Figure 1. The hydroxyapatite method of subtractive hybridization. cDNA derived from the 
treated /altered (tester) population is mixed with a large excess of mRNA from the control (driver) 
population. Following hybridization, mRNA-cDNA hybrids are removed by hydroxyapatite 
chromatography. The only cDNAs which remain are those which are differentially expressed in 
the treated/altered population. In order to facilitate the recovery of full length clones, small cDNA 
fragments are removed by exclusion chromatography. The remaining cDN As are then cloned into 
a vector for sequencing, or labelled and used directly to probe a library, as described by Sargent 
and Dawid(1983). 



containing a restriction site) ligated to both sides. Both populations are then 
amplified by PCR, but the driver cDNA population is subsequently digested with 
the adaptor-containing restriction endonuclease. This serves to cleave the oligo- 
vector and reduce the amplification potential of the control population. The digested 
control population is then biotinylated and an excess mixed with tester cDNA. 
Following denaturation and hybridization, the mix is applied to a biocytin column 
(streptavidin may also be used) to remove the control population, including 
heteroduplexes formed by annealing of common sequences from the tester 
population. The procedure is repeated several times following the addition of fresh 
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Control (driver) mRNA 



■AAAA 
■AAAA 



Test (tester) mRNA 

AAAA 

AAAA 



I 



Anneal mRNA to poiydTa latex beads 



AAAA- 



1 



cDNA synthesis 



Mix and anneal 



T 



AAAA- 



AAAA 



AAAA 



Centrifuge beads, collect and store supernatant, 
dissociate polyA, reapply supernatant 

AAAA Tester-specific mRNA retrieved after 

AAAA — 4 rounds of hybridization 
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Sequence inserts and/or carry out 
other downstream applications 

Figure 2. The use of oligodT^ latex to perform subtractive hybridization. mRNA extracted from the 
control (driver) population is converted to anchored cDNA using polydT oligonucleotides 
attached to latex beads. mRNA from the treated/altered (tester) population is repeatedly 
hybridized against an excess of the anchored driver cDNA. The final population of mRNA is 
tester specific and can be converted into cDNA for cloning and other downstream applications, as 
described by Hara et aL (1991). 
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control cDNA. In order to further enrich those species differentially expressed in 
the tester cDNA, the subtracted tester population is amplified by PCR following 
every second subtraction cycle. After six cycles of subtraction (three reamplification 
steps) the reaction mix is ligated into a vector for further analysis. 

Jn a slightly different approach, Hara et al (1991) utilized a method whereby 
oligotdTgJ primers attached to a latex substrate are used to first capture mRNA 
extracted from the control population. Following 1st strand cDNA synthesis, the 
RNA strand of the heteroduplexes is removed by heat denaturation and centri- 
fugation (the cDNA-oligotex-dT^ forms a pellet and the supernatant is removed). 
A quantity of tester mRNA is then repeatedly hybridized to the immobilized control 
(driver) cDNA (which is present in 20-fold excess). After several rounds of 
hybridization the only mRNA molecules left in the tester mRNA population are 
those which are not found in the driver cDNA-oligotex-dT^ population. These 
tester-specific mRNA species are then converted to cDNA and, following the 
addition of adaptor sequences, amplified by PCR. The PCR products are then 
ligated into a vector for further analysis using restriction sites incorporated into the 
PCR primers. A schematic illustration of this subtraction process is shown in figure 
2. 

However, all these methods utilising physical separation have been described as 
inefficient due to the requirement for large starting amounts of mRNA, significant 
loss of material during the separation process and a need for several rounds of 
hybridization. Hence, new methods of differential expression analysis have recently 
been designed to eliminate these problems. 

Chemical Cross-Linking Subtraction (CCLS) 

In this technique, originally described by Hampson et al. (1992), driver mRNA 
is mixed with tester cDNA (1st strand only) in a ratio of >20:1. The common 
sequences form cDNA:mRNA hybrids, leaving the tester specific species as single 
stranded cDNA. Instead of physically separating these hybrids, they are inactivated 
chemically using 2,5 diaziridinyl-1 ,4-benzoquinone (DZQ). Labelled probes are 
then synthesized from the remaining single stranded cDNA species (unreacted 
mRNA species remaining from the driver are not converted into probe material due 
to specificity of Sequenase T7 DNA polymerase used to make the probe) and used 
to screen a cDNA library made from the tester cell population. A schematic diagram 
of the system is shown in figure 3. 

It has been shown that the differentially expressed sequences can be enriched at 
least 300-fold with one round of subtraction (Hampson et al. 1992), and that the 
technique should allow isolation of cDN As derived from transcripts that are present 
at less than 50 copies per cell. This equates to genes at the low end of intermediate 
abundance (see table 1). The main advantages of the CCLS approach are that it is 
rapid, technically simple and also produces fewer false positives than other 
differential expression analysis methods. However, like the physical separation 
protocols, a major drawback with CCLS is the large amount of starting material 
required (at least 10 ^g RNA). Consequently, the technique has recently been 
refined so that a renewable source of RNA can be generated. The degenerate random 
oligonucleotide primed (DROP) adaptation (Hampson et al. 1996, Hampson and 
Hampson 1997) uses random hexanucleotide sequences to prime solid phase- 
synthesized cDNA. Since each primer includes a T7 polymerase promotor sequence 
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Control (driver) mRNA 



-AAAA 
-AAAA 



mRNAxDNA hybrids 



Test (tester) mRNA 



"AAAA 
"AAAA 



1st strand cDNA synthesis . 
followed by alkaline hydrolysis J, 



Mix and anneal 



-AAAA 



Unique cDNA species 



-TTTT 



Cross linking agent 
(OZQ) added 



Hybrids are cross-linked xxxxxxxxx AAAA 



+ 



Probes synthesised from single stranded cDNA 
species and used to probe cDNA library 



Figure 1 Chemical cross-linking subtraction. Excess driver mRNA is mixed with 1* strand tester 
<! DNA ; .^e common sequences form mRNA:cDNA hybrids which are cross linked with 2,5 
diaziridinyM,4-benzoquinone (DZQ) and the remaining cDNA sequences are differentially 
expressed m the tester population. Probes are made from these sequences using Sequenase 2 0 
DNA polymerase, which lacks reverse transcriptase activity and, therefore, does not react with the 
remaining mRNA molecules from the driver. The labelled probes are then used to screen a cDNA 
library for clones of differentially expressed sequences. Adapted from Walter et al (1996) with 
permission. " 



Tabl e 1. The abundance of mRNA species and classes in a typical mammalian cell. 

Mean mass 

Copies of No. of mRNA Mean % of (ng) of each 
mRNA each species in each species species /pg 
£1S2 species/cell class in class total RNA 

Abundant 12000 4 3.3 1,65 

Intermediate 300 500 0.08 0.04 

Rare 15 11000 0.004 0*002 



Modified from Bertioli et al. (1995). 
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at the 5 'end, the final pool of random cDNA fragments is a PCR-renewable cDNA 
population which is representative of the expressed gene pool and can be used to 
synthesize sense RNA for use as driver material. Furthermore, if the final pool of 
random cDNA fragments is reamplified using biotinylated T7 primer and random 
hexamer, the product can be captured with streptavidin beads and the antisense 
strand eluted for use as tester. Since both target and driver can be generated from 
the same DROP product, subtraction can be performed in both directions (i.e. for 
up- and down-regulated species) between two different DROP products. 

Representational Difference Analysis (RDA) 

RDA of cDNA (Hubank and Schatz 1994) is an extension of the technique 
originally applied to genomic DNA as a means of identifying differences between 
two complex genomes (Lisitsyn et al. 1993). It is a process of subtraction and 
amplification involving subtractive hybridization of the tester in the presence of 
excess driver. Sequences in the tester that have homologues in the driver are 
rendered unamplifiable, whereas those genes expressed only in the tester retain the 
ability to be amplified by PCR. The procedure is shown schematically in figure 4. 

In essence. the driver and tester mRNA populations are first converted to cDNA 
and amplified by PCR following the ligation of an adaptor. The adaptors are then 
removed from both populations and a new (different) adaptor ligated to the 
amplified tester population only. Driver and tester populations are next melted and 
hybridized together in a ratio of 100 :1 . Following hybridization, only tester:tester 
homohybrids have 5 'adaptors at each end of the DNA duplex and can, thus, be filled 
in at both 3' ends. Hence, only these molecules are amplified exponentially during 
the subsequent PCR step. Although tester: driver heterohybrids are present, they 
only amplify in a linear fashion, since the strand derived from the driver has no 
adaptor to which the primer can bind. Driver : driver heterohybrids have no 
adaptors and, therefore, are not amplified. Single stranded molecules are digested 
with mung bean nuclease before a further PCR-enrichment of the tester:tester 
homohybrids. The adaptors on the amplified tester population are then replaced and 
the whole process repeated a further two or three times using an increasing excess of 
driver (Hubank and Shatz used a tester :driver ratio of 1:400, 1:80000 and 
1:800000 for the second, third and fourth hybridizations, respectively). Different 
adaptors are ligated to the tester between successive rounds of hybridization and 
amplification to prevent the accumulation of PCR products that might interfere with 
subsequent amplifications. The final display is a series of differentially expressed 
gene products easily observable on an ethidium bromide gel. 

The main advantages of RDA are that it offers a reproducible and sensitive 
approach to the analysis of differentially expressed genes. Hubank and Schatz (1994) 
reported that they were able to isolate genes that were differentially expressed in 
substantially less than 1 % of the cells from which the tester is derived. Perhaps the 
main drawback is that multiple rounds of ligation, hybridization, amplifiation and 
digestion are required. The procedure is, therefore, lengthier than many other 
differential display approaches and provides more opportunity for operator-induced 
error to occur. Although the generation of false positives has been noted, this has 
been solved to some degree by O'Neill and Sinclair (1 997) through the use of HPLC- 
purified adaptors. These are free of the truncated adaptors which appear to be a 
major source of the false positive bands. A very similar technique to RDA, termed 
linker capture subtraction (LCS) was described by Yang and Sytowski (1996). 
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Figure 4. The representational difference analysis (RDA) technique. Driver and tester cDNA are 
digested with a 4-cutter restriction enzyme such as DpnU. The l* 1 set of 12/24 adaptor strands 
(oligonucleotides) are ligated to each other and the digested cDNA products. The 12mer is 
subsequently melted away and the 3'ends filled in using Taq DNA polymerase Each cDNA 
population is then amplified using PCR, following which the V x set of adaptors is removed with 
DpnU. A second set of 12/24 adaptor strands is then added to the amplified tester cDNA 
population, after which the tester is hybridized against a large excess of driver. The 12mer 
adaptors are melted and the 3' ends filled in as before. PCR is carried out with primers identical 
to the new 24mer adaptor. Thus, the only hybridization products which are exponentially 
amplified are those which are tester .- tester combinations. Following PCR, ssDNA products are 
removed with mung bean nuclease, leaving the 'first difference product*. This is digested and a 
third set of 12/24 adaptors added before repeating the subtraction process from the hybridization 
stage. The process is repeated to the 3' d or 4 th difference product, as described by Lisitsyn et aL 
(1993) and Hubank and Schatz (1994). 
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Suppression PCR Subtractive Hybridization (SSH) 

The most recent adaptation of the SH approach to differential expression 
analysis was first described by Diatchenko et al (1996) and Gurskaya et al. (1996). 
They reported that a 1000-5000 fold enrichment of rare cDNAs (equivalent to 
isolating mRNAs present at only a few copies per cell) can be obtained without the 
need for multiple hybridizations /subtractions. Instead of physical or chemical 
removal of the common sequences, a PCR-based suppression system is used (see 
figure 5). 

In SSH, excess driver cDNA is added to two portions of the tester cDNA which 
have been ligated with different adaptors. A first round of hybridization serves to 
enrich differentially expressed genes and equalize rare and abundant messages. 
Equalization occurs since reannealing is more rapid for abundant molecules than for 
rarer molecules due to the second order kinetics of hybridization (James and Higgins 
1 985). The two primary hybridization mixes are then mixed together in the presence 
of excess driver and allowed to hybridize further. This step permits the annealing of 
single stranded complementary sequences which did not hybridize in the primary 
hybridization, and in doing so generates templates for PCR amplification. Although 
there are several possible combinations of the single stranded molecules present in 
the secondary hybridization mix, only one particular combination (differentially 
expressed in the tester cDNA composed of complimentary strands having different 
adaptors) can amplify exponentially. 

Having obtained the final differential display, two options are available if cloning 
of cDNAs is desired. One is to transform the whole of the final PCR reaction into 
competent cells. Transformed colonies can then be isolated and their inserts 
characterized by sequencing, restriction analysis or PCR. Alternatively, the final 
PCR products can be resolved on a gel and the individual bands excised, reamplified 
and cloned. The first approach is technically simpler and less time consuming. 
However, ligation/transformation reactions are known to be biased towards the 
cloning of smaller molecules, and so the final population of clones will probably not 
contain a representative selection of the larger products. In addition, although 
equalization theoretically occurs, observations in this laboratory suggest that this is 
by no means perfectly accomplished. Consequently, some gene species are present 
in a higher number than others and this will be represented in the final population 
of clones. Thus, in order to obtain a substantial proportion of those gene species that 
actually demonstrate differential expression in the tester population, the number of 
clones that will have to be screened after this step may be substantial. The second 
approach is initially more time consuming and technically demanding. However, it 
would appear to offer better prospects for cloning larger and low abundance gel 
products. In addition, one can incorporate a screening step that differentiates 
different products of different sequences but of the same size (HA-staining, see 
later). In this way, a good idea of the final number of clones to be isolated and 
identified can be achieved. 

An alternative (or even complementary) approach is to use the final differential 
display reaction to screen a cDNA library to isolate full length clones for further 
characterization, or a DNA array (see later) to quickly identify known genes. SSH 
has been used in this laboratory to begin characterization of the short-term gene 
expression profiles of enzyme-inducers such as phenobarbital (Rockett et al. 1997) 
and Wy-14,643 (Rockett et al. unpublished observations). The isolation of 
differentially expressed genes in this manner enables the construction of a fingerprint 
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Figure 5. PCR-select cDNA subtraction. In the primary hybridization, an excess of driver cDNA is 
added to each tester cDNA population. The samples are heat denatured and allowed to hybridize 
for between 3 and 8 h. This serves two purposes : (1 ) to equalize rare and abundant molecules ; and 
(2) to enrich for differentially expressed sequences— cDNAs that are not differentially expressed 
form type c molecules with the driver. In the secondary hybridization, the two primary 
hybridizations are mixed together without denaturing. Fresh denatured driver can also be added 
at this point to allow further enrichment of differentially expressed sequences. Type e molecules 
are formed in this secondary hybridization which are subsequently amplified using two rounds of 
PCR. The final products can be visualized on an agarose gel, labelled directly or cloned into a 
vector for downstream manipulation. As described by Diatchenko et al. (1996) and Gurskaya 
et al. (1996), with permission. 
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Figure 6. Flow diagram showing method used in this laboratory to isolate and identify clones of genes 
which are differentially expressed in rat liver following short term exposure to the enzyme 
inducers, phenobarbital and Wy-14,643. 



of expressed genes which are unique to each compound and time /dose point. Such 
information could be useful in short-term characterization of the toxic potential of 
new compounds by comparing the gene-expression profiles they elicit with those 
produced by known inducers. Figure 6 shows a flow diagram of the method used to 
isolate, verify and clone differentially expressed genes, and figure 7 shows expression 
profiles obtained from a typical SSH experiment. Subsequent sub-cloning of the 
individual bands, sequencing and gene data base interrogation reveals many genes 
which are either up- or down-regulated by phenobarbital in the rat (tables 2 and 3). 

One of the advantages in using the SSH approach is that no prior knowledge is 
required of which specific genes are up/down-regulated subsequent to xenobiotic 
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Figure 7. SSH display patterns obtained from rat liver following 3-day treatment with WY-14 643 or 
phenobarbital. mRNA extracted from control and treated livers was used to generate the 
differential displays using the PCR-Select cDNA subtraction kit (Clontech) Lane- 1— lkb 
w d ?I^ 8eneS u P re & ulated ^Mowing Wy,14-643 treatment; 3-genes downregulated following 
Wy,14-643 treatment; 4-genes upregulated following phenobarbital treatment; 5-genes 

?S , Wm6 Phenobarbital treatment; 6-lkb ladder. Reproduced from Rockett et 
al. (1VV7J, with permission. 

exposure, and an almost complete complement of genes are obtained. For example, 
the peroxisome proliferator and non-genotoxic hepatocarcinogen Wy.14,643, up- 
regulates at least 28 genes and down-regulates at least 15 in the rat' (a 'sensitive 
species) and produces 48 up- and 37 down-regulated genes in the guinea pig, a 
resistant species (Rockett, Swales, Esda and Gibson, unpublished observations). 
One of these genes, CD81, was up-regulated in the rat and down-regulated in the 
guinea pig following Wy-14,643 treatment. CD81 (alternatively named TAPA-1) is 
a widely expressed cell surface protein which is involved in a large number of cellular 
processes including adhesion, activation, proliferation and differentiation (Levy et 
al. 1998). Since all of these functions are altered to some extent in the phenomena 
of hepatomegaly and non-genotoxic hepatocarcinogenesis, it is intriguing, and 
probably mechanistically-relevant, that CD81 expression is differentially regulated 
in a resistant and susceptible species. However, the down-side of this approach is 
that the majority of genes can be sequenced and matched to database sequences, but 
the latter are predominantly expressed sequence tags or genes of completely 
unknown function, thus partially obscuring a realistic overall assessment of the 
critical genes of genuine biological interest. Notwithstanding the lack of complete 
funtional identification of altered gene expression, such gene profiling studies 
essentially provides a 'molecular fingerprint' in response to xenobiotic challenge, 
thereby serving as a mechanistically-relevant platform for further detailed 
investigations. 

Differential Display (DD) 

Originally described as 'RNA fingerprinting by arbitrarily primed PCR ' (Liang 
and Pardee 1992) this method is now more commonly referred to as 'differential 
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Table 2. Genes up-regulated in rat liver following 3-day exposure to phenobarbital. 
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Highest sequence 




size in bp) 


similarity 


FASTA-EMBL gene identificati n 


5 (13UU; 






m /4 f\f\f\\ 

7 (1000) 


nc 1 0/ 


Preproalbumin 






Serum albumin mRNA 


8 (950) 


98.3% 


NCI-CGAP-Prl ff. wpuwj (EST) 


10(850) 


95.7% 


CYP2B1 


11 (800) 


4 f\ A f\ fit 

Clone l 94.9% 


CYP2B1 




Clone 2 75.3% 


CYP2B2 


12 (750) 


93.8% 


TRPM-2 mRNA 






Sulfated glycoprotein 


15(600) 


92.9% 


Preproalbumin 






Serum albumin mRNA 


16(55) 


Clone l 95.2% 


CYP2B1 




Clone 2 93.6% 


Haptoglobulin mRNA partial alpha 


21 (350) 


99.3% 


18S, 5.8S & 28S rRNa 



Bands 1-4, 6, 9, 13, 14, and 17-20 are shown to be false positives by dot blot anaylsis and, therefore, 
are not sequenced. Derived from Rockett et al. (1997). It should be noted that the above genes do not 
represent the complete spectrum of genes which are up-regulated in rat liver by phenobarbital, but 
simply represents the genes sequenced and identified to date. 



Table 3. 


Genes down-regulated in rat liver following 3-day exposure to phenobarbital. 


Band number 






(approximate 


Highest sequence 




size in bp) 


similarity 


FASTA-EMBL gene identification 


1 (1500) 


95.3% 


3-oxoacyl-CoA thiolase 


2 (1200) 


92.3% 


Hemopoxin mRNA 


3 (1000) 


91.7% 


Alpha-2u-globulin mRNA 


7 (700) 


Clone 1 77.2% 


M.musculus CI inhibitor 




Clone 2 94.5% 


Electron transfer flavoprotein 




Clone 3 91.0% 


M. musculus Topoisomerase 1 (Topo 1) 


8 (650) 


Clone 1 86.9% 


Soares 2NbMT M. musculus (EST) 




Clone 2 96.2% 


Alpha-2u-globulin (s-type) mRNA 


9 (600) 


Clone 1 86.9% 


Soares mouse NML M, musculus (EST) 




Clone 2 82.0% 


Soares p3NMF 19.5 M. musculus (EST) 


10 (550) 


73.8% 


Soares mouse NML M. musculus (EST) 


11 (525) 


95.7% 


NCI-CGAP-Prl H. sapiens (EST) 


12 (375) 


100.0% 


Ribosomal protein 


13 (23) 


Clone 1 97.2% 


Soares mouse embryo NbME135 (EST) 




Clone 2 100.0% 


Fibrinogen B-beta-chain 




Clone 3 100.0% 


Apolipoprotein E gene 


14 (170) 


96.0% 


Soares p3NMF19.5 M. musculus (EST) 


15 (140) 


97.3% 


Stratagene mouse testis (EST) 


Others: (300) 


96.7% 


R. norvegicus RASP 1 mRNA 


(275) 


93.1% 


Soares mouse mammary gland (EST) 



EST = Expressed sequence tag. Bands 4-6 were shown to be false positives by dot blot analysis and, 
therefore, were not sequenced. Derived from Rock en et al. (1997). It should be noted that the above genes 
do not represent the complete spectrum of genes which are down-regulated in rat liver by phenobarbital, 
but simiply represents the genes sequenced and identified to date. 



display' (DD). In this method, all the mRNA species in the control and treated cell 
populations are amplified in separate reactions using reverse transcriptase-PCR 
(RT-PCR). The products are then run side-by-side on sequencing gels. Those 
bands which are present in one display only, or which are much more intense in one 
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display compared to the other, are differentially expressed and may be recovered for 
further characterization. One advantage of this system is the speed with which it can 
be carried out — 2 days to obtain a display and as little as a week to make and identify 
clones. 

Two commonly used variations are based on different methods of priming the 
reverse transcription step (figure 8). One is to use an oligo dT with a 2-base 'anchor' 
at the 3'-end, e.g. 5' (dT n )CA 3' (Liang and Pardee 1992). Alternatively, an 
arbitrary primer may be used for 1st strand cDNA synthesis (Welsh et aL 1992). 
This variant of RNA fingerprinting has also been called 'RAP' (RNA Arbitrarily 
Primed)-PCR. One advantage of this second approach is that PCR products may be 
derived from anywhere in the RNA, including open reading frames. In addition, it 
can be used for mRNAs that are not polyadenylated, such as many bacterial mRNAs 
(Wong and McClelland 1994). In both cases, following reverse transcription and 
denaturation, second strand cDNA synthesis is carried out with an arbitrary primer 
{arbitrary primers have a single base at each position, as compared to random 
primers, which contain a mixture of all four bases at each position). The resulting 
PCR, thus, produces a series of products which, depending on the system (primer 
length and composition, polymerase and gel system), usually includes 50-100 
products per primer set (Band and Sager 1989). When a combination of different 
dT-anchors and arbitrary primers are used, almost all mRNA species from a cell can 
be amplified. When the cDNA products from two different populations are analysed 
side by side on a polyacrylamide gel, differences in expression can be identified and 
the appropriate bands recovered for cloning and further analysis. 

Although DD is perhaps the most popular approach used today for identifying 
differentially expressed genes, it does suffer from several perceived disadvantages: 

(1) It may have a strong bias towards high copy number mRNAs (Bertioli et al. 
1995), although this has been disputed (Wan et al. 1996) and the isolation of very 
low abundance genes may be achieved in certain circumstances (Guimeraes et 
aL 1995a). 

(2) The cDNAs obtained often only represent the extreme 3' end of the mRNA 
(often the 3 '-untranslated region), although this may not always be the case 
(Guimeraes et aL 1995a). Since the 3'end is often not included in Genbank and 
shows variation between organisms, cDNAs identified by DD cannot always be 
matched with their genes, even if they have been identified. 

(3) The pattern of differential expression seen on the display often cannot be 
reproduced on Northern blots, with false positives arising in up to 70% of cases 
(Sun et aL 1994). Some adaptations have been shown to reduce false positives, 
including the use of two reverse transcriptases (Sung and Denman 1997), 
comparison of uninduced and induced cells over a time course (Burn et aL 1 994) 
and comparison of DDPCR-products from two uninduced and two induced 
lines (Sompayrac et aL 1995). The latter authors also reported that the use of 
cytoplasmic RNA rather then total RNA reduces false positives arising from 
nuclear RNA that is not transported to the cytoplasm. 

Further details of the background, strengths and weaknesses of the DD 
technique can be obtained from a review by McClelland et aL (1996) and from 
articles by Liang et aL (1995) and Wan et aL (1996). 
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mRNA 



(dTn)CA: AC 




-AAAAAAAA 

Arbitrary primer. 



1 st strand cDNA 
4 AC 



1 st strand cDNA 
< 



-UGAAAAAAA 



"AAAAAAA 



Denature and synthesise 2 nd strand 
with any arbitrary primer ( ) 



2 ftd strand cDNA 



2 nd strand cDNA 
► 



cDNA can now be amplified by PCR using original primer pair 

Figure 8. Two approaches to differential display (DD) analysis. 1 st strand synthesis can be carried out 
either with a polydT n NN primer (where N = G, C or A) or with an arbitrary primer. The use of 
different combinations of G, C and A to anchor the first strand polydT primer enables the priming 
of the majority of polyadenylated mRNAs. Arbitrary primers may hybridize at none, one or more 
places along the length of the mRNA, allowing l sl strand cDNA synthesis to occur at none, one 
or more points in the same gene. In both cases, 2 nd strand synthesis is carried out with an arbitrary 
primer. Since these arbitrary primers for the 2 nd strand may also hybridize to the 1* strand cDNA 
in a number of different places, several different 2 nd strand products may be obtained from one 
binding point of the l sl strand primer. Following 2 nd strand synthesis, the original set of primers 
is used to amplify the second strand products, with the result that numerous gene sequences are 
amplified. 



Restriction endonuclease-facilitated analysis of gene expression 

Serial Analysis of Gene Expression ( SAGE) 

A more recent development in the field of differential display is SAGE analysis 
(Velculescu et al. 1995). This method uses a different approach to those discussed so 
far and is based on two principles. Firstly, in more than 95% of cases, short 
nucleotide sequences ('tags') of only nine or 10 base pairs provide sufficient 
information to identify their gene of origin. Secondly, concatenation (linking 
together in a series) of these tags allows sequencing of multiple cDNAs within a 
single clone. Figure 9 shows a schematic representation of the SAGE process. In this 
procedure, double stranded cDNA from the test cells is synthesized with a 
biotinylated polydT primer. Following digestion with a commonly cutting (4bp 
recognition sequence) restriction enzyme ('anchoring enzyme'), the 3' ends of the 
cDNA population are captured with streptavidin beads. The captured population is 
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split into two and different adaptors ligated to the 5 'ends of each group. Incorporated 
into the adaptors is a recognition sequence for a type IIS restriction enzyme— one 
which cuts DNA at a defined distance (< 20 bp) from its recognition sequence. 
Hence, following digestion of each captured cDNA population with the IIS enzyme, 
the adaptors plus a short piece of the captured cDNA are released. The two 
populations are then ligated and the products amplified. The amplified products are 
cleaved with the original anchoring enzyme, religated (concatomers are formed in 
the process) and cloned. The advantage of this system is that hundreds of gene tags 
can be identified by sequencing only a few clones. Furthermore, the number of times 
a given transcript is identified is a quantitative measurement of that gene's 
abundance in the original population, a feature which facilitates identification of 
differentially expressed genes in different cell populations. 

Some disadvantages of SAGE analysis include the technical difficulty of the 
method, a large amount of accurate sequencing is required, biased towards abundant 
mRNAs, has not been validated in the pharmaco/toxicogenomic setting and has 
only been used to examine well known tissue differences to date. 

Gene Expression Fingerprinting ( GEF ) 

A different capture/restriction digest approach for isolating differentially 
expressed genes has been described by Ivanova and Belyavsky (1995). In this 
method, RNA is converted to cDNA using biotinylated oligo(dT) primers. The 
cDNA population is then digested with a specific endonuclease and captured with 
magnetic streptavidin microbeads to facilitate removal of the unwanted 5' digestion 
products. The use of restricted 3 '-ends alone serves to reduce the complexity of the 
cDNA fragment pool and helps to ensure that each RNA species is represented by 
not more than one restriction product. An adaptor is ligated to facilitate subsequent 
amplification of the captured population. PCR is carried out with one adaptor- 
specific and one biotinylated polydT primer. The reamplified population is 
recaptured and the non-biotinylated strands removed by alkaline dissociation. The 
non-biotinylated strand is then resynthesized using a different adaptor-specific 
primer in thepresence of a radiolabeled dNTP. The labelled immobilized 3'cDNA 
ends are next sequentially treated with a series of different restriction endonucleases 
and the products from each digestion analysed by PAGE. The result is a fingerprint 
composed of a number of ladders (equal to the number of sequential digests used). 
By comparing test versus control fingerprints, it is possible to identify differentially 
expressed products which can then be isolated from the gel and cloned. The 
advantages of this procedure are that it is very robust and reproducible, and the 
authors estimate that 80-93% of cDNA molecules are involved in the final 
fingerprint. The disadvantage is that polyacrylamide gels can rarely resolve more 
than 300-400 bands, which compares poorly to the 1000 or more which are 
estimated to be produced in an average experiment. The use of 2-D gels such as 
those described by Uitterlinden et al. (1989) and Hatada et al. (1991) may help to 
overcome this problem. 

A similar method for displaying restriction endonuclease fragments was later 
described by Prashar and Weissman (1996). However, instead of sequential 
digestion of the immobolized 3'-terminal cDNA fragments, these authors simply 
compared the profiles of the control and treated populations without further 
manipulation. 
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GTAC, 
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CATG- 
GTAC 
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GTAC- 



-AAAA 
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and produce blunt ends 



GGATGCATGXXXXXXXXX 
CCTACGTACXXXXXXXXX 



GGATGCATGOOOOOOOOO 
CCTACGTACOOOOOOOOO 



TE AE 



Tag 



TE AE 



Tag 



| Ligate and amplify 



GGATGCATGXXXXXXXXXOOOOOOOOOCATGCATCC 
CCTACGTACXXXXXXXXXOOOOOOOOOGTACGTAGG 



DiTag 



AE 



AE 



Cleave with AE, isolate diTags, 
concatenate, clone and 
sequence 
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— CATGXXXXXXXXXOOOOOOOOOCATG XXXXXXXXXOOOOOOOOOCATG— 
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Tag! Tag2 



Tag 3 Tag 4 



re 9. Serial analysis of gene expression (SAGE) analysis. cDNA is cleaved with an anchoring enzyme 
(AE)and the 3'ends captured using streptavidin beads. The cDNA pool is divided in half and each 
portion ligated to a different linker, each containing a type IIS restriction site (tagging enzvme, 
TE). Restriction with the type IIS enzyme releases the linker plus a short length of cDNA 
(XXXXX and OOOOO indicate nucleotides of different tags). The two pools of tags are then 
ligated and amplified using linker-specific primers. Following PCR, the products are cleaved with 
the AE and the ditags isolated from the linkers using PAGE. The ditags are then ligated (during 
which process, concatenization occurs) and cloned into a vector of choice for sequencing. After 
Velculescu et al. (1995), with permission. 
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DNA arrays 

'Open * differential display systems are cumbersome in that it takes a great deal 
of time to extract and identify candidate genes and then confirm that they are indeed 
up- or down-regulated in the treated compared to the control tissue. Normally, the 
latter process is carried out using Northern blotting or RT-PCR. Even so, each of 
the aforementioned steps produce a bottleneck to the ultimate goal of rapid'analysis 
of gene expression. These problems will likely be addressed by the development of 
so-called DNA arrays (e.g. Gress et al. 1992, Zhao et al. 1995, Schena etal 1996) 
the introduction of which has signalled the next era in differential gene expression 
analysis DNA arrays consist of a gridded membrane or glass 'chips' containing 
hundreds or thousands of DNA spots, each consisting of multiple copies of part of 
a known gene. The genes are often selected based on previously proven involvement 
in oncogenesis cell cycling, DNA repair, development and other cellular processes 
They are usually chosen to be as specific as possible for each gene and animal species 
Human and mouse arrays are already commercially available and a few companies 
will construct a personalized array to order, for example Clontech Laboratories and 
Research Genetics Inc. The technique is rapid in that hundreds or even thousands 
of genes can be spotted on a single array, and that mRNA /cDNA from the test 
populations can be labelled and used directly as probe. When analysed with 
appropriate hardware and software, arrays offer a rapid and quantitative means to 
assess differences in gene expression between two cell populations. Of course there 
can only be identification and quantitation of those genes which are in the array 
(hence the term 'closed' system). Therefore, one approach to elucidating the 
molecular mechanisms involved in a particular disease/development system may be 
to combine an open and closed system-a DNA array to directly identify and 
quantitate the expression of known genes in mRNA populations, and an open 
system such as SSH to isolate unknown genes which are differentially expressed 

One of the mam advantages of DNA arrays is the huge number of gene fragments 
which can be put on a membrane-some companies have reported gridding up to 
60000 spots on a single glass 'chip' (microscope slide). These high density chip- 
based micro-arrays will probably become available as mass-produced off-the-shelf 
items in the near future. This should facilitate the more rapid determination of 
differential expression in time and dose-response experiments. Aside from their 
high cost and the technical complexities involved in producing and probing DNA 
arrays the main problem which remains, especially with the newer micro-array 
(gene-chip) technologies, is that results are often not wholly reproducible between 
arrays. However, this problem is being addressed and should be resolved within the 
next few years. 



EST databases as a means to identify differentially expressed genes 

Expressed sequence tags (ESTs) are partial sequences of clones obtained from 
cDNA libraries. Even though most ESTs have no formal identity (putative 
identification is the best to be hoped for), they have proven to be a rapid and efficient 
means of discovering new genes and can be used to generate profiles of gene- 
expression m specific cells. Since they were first described by Adams et al (1991) 
there has been a huge explosion in EST production and it is estimated that there are 
now well over a million such sequences in the public domain, representing over half 
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of all human genes (Hillier et al. 1996). This large number of freely available 
sequences (both sequence information and clones are normally available royalty-free 
from the originators) has enabled the development of a new approach towards 
differential gene expression analysis as described by Vasmatzis et al. (1998). The 
approach is simple in theory: EST databases are first searched for genes that have a 
number of related EST sequences from the target tissue of choice, but none or few 
from non-target tissue libraries. Programmes to assist in the assembly of such sets of 
overlapping data may be developed in-house or obtained privately or from the 
internet. For example, the Institute for Genomic Research (TIGR, found at 
http://vww.tigr.org) provides many software tools free of charge to the scientific 
community. Included amongst these is the TIGR assembler (Sutton et aL 1995), a 
tool for the assembly of large sets of overlapping data such as ESTs, bacterial 
artificial chromosomes (BAC)s, or small genomes. Candidate EST clones repre- 
senting different genes are then analysed using RN A blot methods for size and tissue 
specificity and, if required, used as probes to isolate and identify the full length 
cDNA clone for further characterization. In practice however, the method is rather 
more involved, requiring bioinformatic and computer analysis coupled with 
confirmatory molecular studies. Vasmatzis et aL (1998) have described several 
problems in this fledgling approach, such as separating highly homologous 
sequences derived from different genes and an overemphasis of specificity for some 
EST sequences. However, since these problems will largely be addressed by the 
development of more suitable computer algorithms and an increased completeness 
of the EST database, it is likely that this approach to identifying differentially 
expressed genes may enjoy more patronage in the future. 



Problems and potential of differential expression techniques 

The holistic or single cell approach ? 

When working with in vivo models of differential expression, one of the first 
issues to consider must be the presence of multiple cell types in any given specimen. 
For example, a liver sample is likely to contain not only hepatocytes, but also 
(potentially) Ito cells, bile ductule cells, endothelial cells, various immune cells (e.g. 
lymphocytes, macrophages and Kupffer cells) and fibroblasts. Other tissues will 
each have their own distinctive cell populations. Also, in the case of neoplastic tissue, 
there are almost always normal, hyperplastic and/or dysplastic cells present in a 
sample. One must, therefore, be aware that genes obtained from a differential 
display experiment performed on an animal tissue model may not necessarily arise 
exclusively from the intended * target' cells, e.g. hepatocytes /neoplastic cells. If 
appropriate, further analyses using immunohistochemistry, in situ hybridization or 
in situ RT-PCR should be used to confirm which cell types are expressing the 
gene(s) of interest. This problem is probably most acute for those studying the 
differential expression of genes in the development of different cell types, where 
there is a need to examine homologous cell populations. The problem is now being 
addressed at the National Cancer Institute (Bethesda, MD, USA) where new micro- 
disection techniques have been employed to assist in their gene analysis programme, 
the Cancer Genome Anatomy Project (CGAP) (For more information see web site: 
http:/ /vww. ncbi.nlm.nih.gov/ncicgap/intro. html). There are also separation tech- 
niques available that utilise cell-specific antigens as a means to isolate target cells, 
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e.g. fluorescence activated cell sorting (FACS) (Dunbar et al. 1998, Kas-Deelen et 
al. 1998) and magnetic bead technology (Richard et al. 1998, Rogler et al. 1998). 

However, those taking a holistic approach may consider this issue unimportant. 
There is an equally appropriate view that all those genes showing altered expression 
within a compromized tissue should be taken into consideration. After all, since all 
tissues are complex mixes of different, interacting cell types which intimately 
regulate each other's growth and development, it is clear that each cell type could in 
some way contribute (positively or negatively) towards the molecular mechanisms 
which lie behind responses to external stimuli or neoplastic growth. It is perhaps 
then more informative to carry out differential display experiments using in vivo as 
opposed to in vitro models, where uniform populations of identical cells probably 
represent a partial, skewed or even inaccurate picture of the molecular changes that 
occur. 

The incidence and possible implications of inter-individual biological variation 
should be considered in any approach where whole animal models are being used. It 
is clear that individuals (humans and animals) respond in different ways to identical 
stimuli. One of the best characterized examples is the debrisoquine oxidation 
polymorphism, which is mediated by cytochrome CYP2D6 and determines the 
pharmacokinetics of many commonly prescribed drugs (Lennard 1993, Meyer and 
Zanger 1997). The reasons for such differences are varied and complex, but allelic 
variations, regulatory region polymorphisms and even physical and mental health 
can all contribute to observed differences in individual responses. Careful thought 
should, therefore, be given to the specific objectives of the study and to the possible 
value of pooling starting material (tissue/mRNA). The effect of this can be 
beneficial through the ironing out of exaggerated responses and unimportant minor 
fluctuations of (mechanistically) irrelevant genes in individual animals, thus 
providing a clearer overall picture of the general molecular mechanisms of the 
response. However, at the same time such minor variations may be of utmost 
importance in deciding the ability of individual animals to succumb to or resist the 
effects of a given chemical/disease. 



How efficient are differential expression techniques at recovering a high percentage of 
differentially expressed genes ? 

A number of groups have produced experimental data suggesting that mam- 
malian cells produce between 8000-15000 different mRN A species at any one time 
(Mechler and Rabbitts 1981, Hedrick et al. 1984, Bravo 1990), although figures as 
high as 20-30000 have also been quoted (Axel et al. 1976). Hedrick et al. (1984) 
provided evidence suggesting that the majority of these belong to the rare abundance 
class. A breakdown of this abundance distribution is shown in table 1. 

When the results of differential display experiments have been compared with 
data obtained previously using other methods , it is apparent that not all differentially 
expressed mRNAs are represented in the final display. In particular, rare messages 
(which, importantly, often include regulatory proteins) are not easily recovered 
using differential display systems. This is a major shortcoming, as the majority of 
mRN A species exist at levels of less than 0.005 % of the total population (table 1). 
Bertioli et al. (1995) examined the efficiency of DD templates (heterogeneous 
mRNA populations) for recovering rare messages and were unable to detect mRNA 
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species present at less than 1.2% of the total mRNA population— equivalent to an 
intermediate or abundant species. Interestingly, when simple model systems (single 
target only) were used instead of a heterogeneous mRNA population, the same 
primers could detect levels of target mRNA down to 10000X smaller. These results 
are probably best explained by competition for substrates from the many PCR 
products produced in a DD reaction. 

The numbers of differentially expressed mRNAs reported in the literature using 
various model systems provides further evidence that many differentially expressed 
mRNAs are not recovered. For example, DeRisi et aL (1997) used DNA array 
technology to examine gene expression in yeast following exhaustion of sugar in the 
medium, and found that more than 1700 genes showed a change in expression of at 
least 2-fold. In light of such a finding, it would not be unreasonable to suggest that 
of the 8000-15 000 different mRNA species produced by any given mammalian cell, 
up to 1000 or more may show altered expression following chemical stimulation. 
Whilst this may be an extreme figure, it is known that at least 100 genes are 
activated/upregulated in Jurkat (T-) cells following IL-2 stimulation (Ullman et aL 
1990). In addition, Wan et al (1996) estimated that interferon- /-stimulated HeLa 
cells differentially express up to 433 genes (assuming 24000 distinct mRNAs 
expressed by the cells). However, there have been few publications documenting 
anywhere near the recovery of these numbers. For example, in using DD to compare 
normal and regenerating mouse liver, Bauer et aL (1993) found only 70 of 38000 
total bands to be different. Of these, 50% (35 genes) were shown to correspond to 
differentially expressed bands. Chen et aL (1996) reported 10 genes upregulated in 
female rat liver following ethinyl estradiol treatment. McKenzie and Drake (1997) 
identified 14 different gene products whose expression was altered by phorbol 
myristate acetate (PMA, a tumour promoter agent) stimulation of a human 
myelomonocytic cell line. Kilty and Vickers (1997) identified 10 different gene 
products whose expression was upregulated in the peripheral blood leukocytes of 
allergic disease sufferers. Linskens et al. (1995) found 23 genes differentially 
expressed between young and senescent fibroblasts. Techniques other than DD 
have also provided an apparent paucity of differentially expressed genes. Using SH 
for example, Cao et aL (1997) found 15 genes differentially expressed in colorectal 
cancer compared to normal mucosal epithelium. Fitzpatrick et aL (1995) isolated 17 
genes upregulated in rat liver following treatment with the peroxisome proliferator, 
clofibrate; Philips et aL (1990) isolated 12 cDNA clones which were upregulated in 
highly metastatic mammary adenocarcinoma cell lines compared to poorly meta- 
static ones. Prashar and Weissman (1996) used 3' restriction fragment analysis and 
identified approximately 40 genes showing altered expression within 4 h of 
activation of Jurkat T-cells. Groenink and Leegwater (1996) analysed 27 gene 
fragments isolated using SSH of delayed early response phase of liver regeneration 
and found only 12 to be upregulated. 

In the laboratory, SSH was used to isolate up to 70 candidate genes which appear 
to show altered expression in guinea pig liver following short-term treatment with 
the peroxisome proliferator, WY-14,643 (Rockett, Swales, Esdaile and Gibson, 
unpublished observations). However, these findings have still to be confirmed by 
analysis of the extracted tissue mRNA for differential expression of these sequences. 

Whilst the latest differential display technologies are purported to include design 
and experimental modifications to overcome this lack of efficiency (in both the total 
number of differentially expressed genes recovered and the percentage that are true 
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positives), it is still not clear if such adaptations are practically effective — proving 
efficiency by spiking with a known amount of limited numbers of artificial 
construct(s) is one thing, but isolating a high percentage of the rare messages already 
present in an mRNA population is another.- Of course, some models will genuinely 
produce only a small number of differentially expressed genes. In addition, there are 
also technical problems that can reduce efficiency. For example, mRNAs may have 
an unusual primary structure that effectively prevents their amplification by PCR- 
based systems. In addition, it is known that under certain circumstances not all 
mRNAs have 3'polyA sites. For example, during Xenopus development, deadenyl- 
ation is used as a means to stabilize RNAs (Voeltz and Steitz 1998), whilst 
preferential deadenylation may play a role in regulating Hsp70 (and perhaps, 
therefore, other stress protein) expression in Drosophila (Dellavalle et al. 1994). The 
presence of deadenylated mRNAs would clearly reduce the efficiency of systems 
utilizing a polydT reverse transcription step. The efficiency of any system also 
depends on the quality of the starting material. All differential display techniques 
use mRNA as their target material. However, it is difficult to isolate mRNA that is 
completely free of ribosomal RNA. Even if polydT primers are used to prime first 
strand cDNA synthesis, ribosomal RNA is often transcribed to some degree 
(Clontech PCR-Select cDNA Subtraction kit user manual). It has been shown, at 
least in the case of SSH, that a high rRNA:mRNA ratio can lead to inefficient 
subtractive hybridization (Clontech PCR-Select cDNA Subtraction kit user 
manual), and there is no reason to suppose that it will not do likewise in other SH 
approaches. Finally, those techniques that utilise a presubtraction amplification step 
(e.g. RDA) may present a skewed representation since some sequences amplify 
better than others. 

Of course, probably the most important consideration is the temporal factor. It 
is clear that any given differential display experiment can only interrogate a cell at 
one point in time. It may well be that a high percentage of the genes showing altered 
expression at that time are obtained. However, given that disease processes and 
responses to environmental stimuli involve dynamic cascades of signalling, 
regulation, production and action, it is clear that all those genes which are switched 
on/off at different times will not be recovered and, therefore, vital information may 
well be missed. It is, therefore, imperative to obtain as much information about the 
model system beforehand as possible, from which a strategy can be derived for 
targeting specific time points or events that are of particular interest to the 
investigator. One way of getting round this problem of single time point analysis is 
to conduct the experiment over a suitable time course which, of course, adds 
substantially to the amount of work involved. 



How sensitive are differential expression technologies? 

There has been little published data that addresses the issue of how large the 
change in expression must be for it to permit isolation of the gene in question with 
the various differential expression technologies. Although the isolation of genes 
whose expression is changed as little as 1.5-fold has been reported using SSH 
(Groenink and Leegwater 1996), it appears that those demonstrating a change in 
excess of 5-fold are more likely to be picked up. Thus, there is a 'grey zone' 
in between where small changes could fade in and out of isolation between 
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experiments and animals. DD, on the other hand, is not subject to this grey 
zone since, unlike SH approaches, it does not amplify the difference in expression 
between two samples. Wan et al. (1996) reported that differences in expression of 
twofold or more are detectable using DD. 



Resolution and visualization of differential expression products 

It seems highly improbable with current technology that a gel system could be 
developed that is able to resolve all gene species showing altered expression in any 
given test system (be it SH- or DD-based). Polyacrylamide gel electrophoresis 
(PAGE) can resolve size differences down to 0.2%) (Sambrook et al, 1989) and are 
used as standard in DD experiments. Even so, it is clear that a complex series of gene 
products such as those seen in a DD will contain unresolvable components. Thus, 
what appears to be one band in a gel may in fact turn out to be several. Indeed, it has 
been well documented (Mathieu-Daude et al. 1996, Smith et al 1997) that a single 
band extracted from a DD often represents a composite of heterogeneous products, 
and the same has been found for SSH displays in this laboratory (Rockett et al. 
1997). One possible solution was offered by Mathieu-Daude et al. (1996), who 
extracted and reamplified candidate bands from a DD display and used single strand 
conformation polymorphism (SSCP) analysis to confirm which components 
represented the truly differentially expressed product. 

Many scientists often try to avoid the use of PAGE where possible because it is 
technically more demanding than agarose gel electrophoresis (AGE). Unfortunately, 
high resolution agarose gels such as Metaphor (FMC, Lichfield, UK) and AquaPor 
HR (National Diagnostics, Hessle, UK), whilst easier to prepare and manipulate 
than PAGE, can only separate DNA sequences which differ in size by around 
1.5-2% (15-20 base pairs for a 1Kb fragment). Thus, SSH, RDA or other such 
products which differ in size by less than this amount are normally not resolvable. 
However, a simple technique does in fact exist for increasing the resolving power of 
AGE— the inclusion of HA-red (10-phenyl neutral red-PEG ligand) or HA-yellow 
(bisbenzamide-PEG ligand) (Hanse Analytik GmbH, Bremen, Germany) in a 
gel separates identical or closely sized products on base content. Specifically, 
HA-red and -yellow selectively bind to GC and AT DNA motifs, respectively 
(Wawer et al. 1995, Hanse Analytik 1997, personal communication). Since both 
HA-stains possess an overall positive charge, they migrate towards the cathode 
when an electric field is applied. This is in direct opposition to DNA, which 
is negatively charged and, therefore, migrates towards the anode. Thus, if two 
DNA clones are identical in size (as perceived on a standard high resolution 
agarose gel), but differ in AT/GC content, inclusion of a HA-dye in the gel 
will effectively retard the migration of one of the sequences compared to the 
other, effectively making it apparently larger and, thus, providing a means of 
differentiating between the two. The use of HA-red has been shown to resolve 
sequences with an AT variation of less than 1 % (Wawer et al. 1995), whilst Hanse 
Analytik have reported that HA staining is so sensitive that in one case it was used 
to distinguish two 567bp sequences which differed by only a single point mutation 
(Hanse Analytik 1 996, personal communication). Therefore, if one wishes to check 
whether. all the clones produced from a specific band in a differential display 
experiment are derived from the same gene species, a small amount of reamplified 
or digested clone can be run on a standard high resolution gel, and a second aliquot 
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Figure 10. Discrimination of clones of identical/nearly identical size using HA-red. Bands of decreasing 
size (1-5) were extracted from the final display of a suppression subtractive hybridization 
experiment and cloned. Seven colonies were picked at random from each cloned band and their 
inserts amplified using PCR. The products were run on two gels, (A) a high resolution 2 % agarose 
gel, and (B) a high resolution 2% agarose gel containing 1 U/ml HA-red. With few exceptions, all 
the clones from each band appear to be the same size (gel A). However, the presence of HA-red 
(gel B), which separates identically-sized DNA fragments based on the percentage of GC within 
the sequence, clearly indicates the presence of different gene species within each band. For 
example, even though all five re-amplified clones of band 1 appear to be the same size, at least four 
different gene species are represented. 

in a similar gel containing one of the HA-stains. The standard gel should indicate 
any gross size differences, whilst the HA-stained gel should separate otherwise 
unresolvable species (on standard AGE) according to their base content. Geisinger 
et al (1997) reported successful use of this approach for identifying DD-derived 
clones. Figure 10 shows such an experiment carried out in this laboratory on clones 
obtained from a band extracted from an SSH display. 

An alternative approach is to carry out a 2-D analysis of the differential display 
products. In this approach, size-based separation is first carried out in a standard 
agarose gel. The gel slice containing the display is then extracted and incorporated 
in to a HA gel for resolution based on AT/GC content. 

Of course, one should always consider the possibility of there being different 
gene species which are the same size and have the same GC /AT content. However, 
even these species are not unresolvable given some effort— again, one might use 
SSCP, or perhaps a denaturing gradient gel electrophoresis (DG GE) or temperature 
gradient field electrophoresis (TGGE) approach to resolve the contents of a band, 
either directly on the extracted band (Suzuki et al 1991) or on the reamplified 
product. 

The requirement of some differential display techniques to visualize large 
numbers of products (e.g. DD and GEF) can also present a problem in that, in terms 
of numbers, the resolution of PAGE rarely exceeds 300-400 bands. One approach to 
overcoming this might be to use 2-D gels such as those described by Uitterlinden et 
aL (1989) and Hatada et al. (1991). 
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Extraction of differentially expressed bands from a gel can be complex since, in 
some cases (e.g. DD, GEF), the results are visualized by autoradiographic means, 
such that precise overlay of the developed film on the gel must occur if the correct 
band is to be extracted for further analysis. Clearly, a misjudged extraction can 
account for many man-hours lost. This problem , and that of the use of radioisotopes, 
has been addressed by several groups. For example, Lohmann et al. (1995) 
demonstrated that silver staining can be used directly to visualize DD bands in 
horizontal PAGs. An et al. (1996) avoided the use of radioisotopes by transferring a 
small amount (20-30%) of the DNA from their DD to a nylon membrane, and 
visualizing the bands using chemiluminescent staining before going back to extract 
the remaining DNA from the gel. Chen and Peck (1996) went one step further and 
transferred the entire DD to a nylon membrane. The DNA bands were then 
visualized using a digoxigenin (DIG) system (DIG was attached to the polydT 
primers used in the differential display procedure). Differentially expressed bands 
were cut from the membrane and the DNA eluted by washing with PCR buffer prior 
to reamplification. 

One of the advantages of using techniques such as SSH and RD A is that the final 
display can be run on an agarose gel and the bands visualized with simple ethidium 
bromide staining. Whilst this approach can provide acceptable results, overstating 
with SYBR G reen I or SYBR Gold nucleic acid stains (FMC) effectively enhances 
the intensity and sharpness of the bands. This greatly aids in their precise extraction 
and often reveals some faint products that may otherwise be overlooked. Whilst 
differential displays stained with SYBR Green I are better visualized using short 
wavelength UV (254 nm) rather than medium wavelength (306 nm), the shorter 
wavelength is much more DNA damaging. In practice, it takes only a few seconds 
to damage DNA extracted under 254 nm irradiation, effectively preventing 
reamplification and cloning. The best approach is to overstain with SYBR Green I 
and extract bands under a medium wavelength UV transillumination. 

The possible use of 'microfingerprinting ' to reduce complexity 

Given the sheer number of gene products and the possible complexity of each 
band, an alternative approach to rapid characterization may be to use an enhanced 
analysis of a small section of a differential display — a * sub-fingerprint ' or 'micro- 
fingerprint'. In this case, one could concentrate on those bands which only appear 
in a particular chosen size region. Reducing the fingerprint in this way has at least 
two advantages. One is that it should be possible to use different gel types, 
concentrations and run times tailored exactly to that region. Currently, one might 
run products from 100-3000 + bp on the same gel, which leads to compromize in the 
gel system being used and consequently to suboptimal resolution, both in terms of 
size and numbers, and can lead to problems in the accurate excision of individual 
bands. Secondly, it may be possible to enhance resolution by using a 2-D analysis 
using a HA-stain, as described earlier. In summary, if a range of gene product sizes 
is carefully chosen to included certain ' relevant ' genes, the 2-D system standardized, 
and appropriate gene analysis used, it may be possible to develop a method for the 
early and rapid identification of compounds which have similar or widely different 
cellular effects. If the prognosis for exposure to one or more other chemicals which 
display a similar profile is already known, then one could perhaps predict similar 
effects for any new compounds which show a similar micro-fingerprint. 
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An alternative approach to microfingerprinting is to examine altered expression 
in specific families of genes through careful selection of PCR primers and/or post- 
reaction analysis. Stress genes, growth factors and/or their receptors, cell cycling 
genes, cytochromes P450 and regulatory proteins might be considered as candidates 
for analysis in this way. Indeed, some off-the-shelf DNA arrays (e.g. Clontech's 
Atlas cDNA Expression Array series) already anticipated this to some degree by 
grouping together genes involved in different responses e.g. apoptosis, stress, DNA- 
damage response etc. 



Screening 

False positives 

The generation of false positives has been discussed at length amongst the 
differential display community (Liang et al. 1 993 , 1 995, Nishio et al 1994, Sun et al. 
1994, Sompayrac et al. 1995). The reason for false positives varies with the 
technique being used. For instance, in RDA, the use of adaptors which have not 
been HPLC purified can lead to the production of false positives through illegitimate 
ligation events (O'Neill and Sinclair 1997), whilst in DD they can arise through 
PCR artifacts and illegitemate transcription of rRNA. In SH, false positives appear 
to be derived largely from abundant gene species, although some may arise from 
cDNA/mRNA species which do not undergo hybridization for technical reasons. 

A quick screening of putative differentially expressed clones can be carried out 
using a simple dot blot approach, in which labelled first strand probes synthesized 
from tester and driver mRNA are hybridized to an array of said clones (Hedrick et 
al. 1984, Sakaguchi et al. 1986). Differentially expressed clones will hybridize to 
tester probe, but not driver. The disadvantage of this approach is that rare species 
may not generate detectable hybridization signals. One option for those using SSH 
is to screen the clones using a labelled probe generated from the subtracted cDNA 
from which it was derived, and with a probe made from the reverse subtraction 
reaction (ClonTechniques 1997a). Since the SSH method enriches rare sequences, 
it should be possible to confirm the presence of clones representing low abundance 
genes. Despite this quick screening step, there is still the need to go back to the 
original mRNA and confirm the altered expression using a more quantitative 
approach. Although this may be achieved using Northern blots, the sensitivity is 
poor by today's high standards and one must rely on PCR methods for accurate and 
sensitive determinations (see below). 



Sequence analysis 

The majority of differential display procedures produce final products which are 
between 100 and lOOObp in size. However, this may considerably reduce the size of 
the sequence for analysis of the DNA databases. This in turn leads to a reduced 
confidence in the result — several families of genes have members whose DNA 
sequences are almost identical except in a few key stretches, e.g. the cytochrome 
P450 gene superfamily (Nelson et al. 1996). Thus, does the clone identified as being 
almost identical to gene X 0 really come from that gene, or its brother gene X, or its 
as yet undiscovered sister X 2 ? For example, using SSH, part of a gene was isolated, 
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which was up-regulated in the liver of rats exposed to Wy-14,643 and was identified 
by a FASTA search as being transferrin (data not shown). However, transferrin is 
known to be downregulated by hypolipidemic peroxisome proliferators such as Wy- 
14,643 (Hertz et al 1996), and this was confirmed with subsequent RT-PCR 
analysis. This suggests that the gene sequence isolated may belong to a gene which 
is closely related to transferrin, but is regulated by a different mechanism. 

A further problem associated with SH technology is redundancy. In most cases 
before SH is carried out, the cDNA population must first be simplified by restriction 
digestion. This is important for at least two reasons : 

(1) To reduce complexity— long cDNA fragments may form complex networks 
which prevent the formation of appropriate hybrids, especially at the high 
concentrations required for efficient hybridization. 

(2) Cutting the cDNAs into small fragments provides better representation of 
individual genes. This is because genes derived from related but distinct 
members of gene families often have similar coding sequences that may cross- 
hybridize and be eliminated during the subtraction procedure (Ko 1990). 
Furthermore, different fragments from the same cDN A may differ considerably 
in terms of hybridization and amplification and, thus, may not efficiently do one 
or the other (Wang and Brown 1991). Thus, some fragments from differentially 
expressed cDNAs may be eliminated during subtractive hybridization pro- 
cedures. However, other fragments may be enriched and isolated. As a 
consequence of this, some genes will be cut one or more times, giving rise to two 
or more fragments of different sizes. If those same genes are differentially 
expressed, then two or more of the different size fragments may come through 
as separate bands on the final differential display, increasing the observed 
redundancy and increasing the number of redundant sequencing reactions. 

Sequence comparisons also throw up another important point — at what degree 
of sequence similarity does one accept a result. Is 90% identitiy between a gene 
derived from your model species and another acceptably close? Is 95% between 
your sequence and one from the same species also acceptable? This problem is 
particularly relevant when the forward and reverse sequence comparisons give 
similar sequences with completely different gene species! An arbitrary decision 
seems to be to allocate genes that are definite (95% and above similarity) and then 
group those between 60 and 95% as being related or possible homologues. 

Quantitative analysis 

At some point, one must give consideration to the quantitative analysis of the 
candidate genes, either as a means of confirming that they are truly differentially 
expressed, or in order to establish just what the differences are. Northern blot 
analysis is a popular approach as it is relatively easy and quick to perform. However, 
the major drawback with Northern blots is that they are often not sensitive enough 
to detect rare sequences. Since the majority of messages expressed in a cell are of low 
abundance (see table 1), this is a major problem. Consequently, RT-PCR may be the 
method of choice for confirming differential expression. Although the procedure is 
somewhat more complex than Northern analysis, requiring synthesis of primers and 
optimization of reaction conditions for each gene species, it is now possible to set up 
high throughput PCR systems using mulitchannel pipettes, 96 +-well plates and 
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appropriate thermal cycling technology. Whilst quantitative analysis is more 
desrrable, bemg more accurate and without reliance on an internal standard Z 
money and „me needed to develop a competitor molecule 
especially when one m.ght be examining tens or even hundreds of gen. specTes The' 

must firs, of all choose an mtern.l standard that does no, change in the tes, ceUs 
compared ,o ft. controls. Numerous reference genes have been ,Hed in , he p , for 
example ■n.erferon-gamrna (IFN-y. Frye e, al. 1 989), /J-actin (Heuvd 7af 1994, 
glyceraldehyde ; 3- P hosphat. dehydrogenase (GAPd£ Wong « j 1W4) d ' 
hydrofolate reductase DHFR, Mohler and Butler 1991) 8.2 li" ,' k r 

7 Zst::: ,9 ?' hy r xan,hi - £ ■ h? R t Fo'st ; 

al. 1998) and a number of others (ClonTechniaues 1997M th. h IKL \ rosset 
standard should not change its level of ~ P r^^ T ^s of cXf 
stage ,n the cell cycle or through the effects of external stimuli. However h ha been 
shown on numerous occasions that the levels of most housekeeping gen^ current v 
used by the research community do in fact change under certam Ln&lTZT 
different tissues (ClonTechniques 1997b). It if impe^ 
iminary experiments be carried out on a panel of housekeeping ge „ es to estlwLh 
their suitability for use in the model system. S estaoiisn 

Interpretation of quantitative data must also be treated with caution Bv 
comparing the lists of genes identified by differential expression one can Per haos 

nTmif Crem SPedeS r6aCt " <*™ ^ » ~^m h r 
ror example, rats and mice appear sensitive to the non-genotoxic effects of a wide 

range of peroxisome proliferates whilst Syrian hamsters and guinea pLs are la"! 

"si?:;™ v 2) T'- Ro rfH ks and Turnbui1 m7 > Lake « ^ ^89 a i9 g 

Makowska et al 1992). A simplified approach to resolving the reason(s) whv is to 
compare lists of up- and down-regulated genes in order to identify tho se whkh a e 
expressed in only one species and, through background knowledge of t he effects of 
the said gene, might suggest a mechanism of facilitated non-genotoxic carcin^enLi, 
or protection. Of course, the situation is likely to be far more cSp^STJTS 
there were one key gene protecting guinea pig from non-genotoxic effects a nd Tit was 
upregulated 50 times by PPs, the same gene might only be up-regulated five time 
» the rat. However, since both were noted to beu pn8 L^cZpo^T^ 
gene may be overlooked. Just to complicate matters a larg c^^^ 
does not necessarily mean a biologically important change. For example what "he 
rue relevance of gene Y which shows a 50-fold increase after a particular r « 
and gene Z which shows only a 5-fold increase ? If one examines the lite raturT one 
may find that historically, gene Y has often been shown to be up-regu . "d 4^0 
fold by a number of unrelated stimuli-in light of this the 50-fold increase wouH 
appear less significant. However, the literature may show that gene Z ha ^be „ 

Lc7el alTthe aVm8 ^ d ° Ubled " "P"which makes your -fo d 

increase all the more exciting. Perhaps even more interesting is ,f that same 5- o d 
increase as only been seen in related neoplasms or followmgtreatment wiTr la d 



Problems in using the differential display approach 

Differential display technology originally held promise of an easily obtainable 
fingerprint of those genes which are up- or down-regulated in test anills}cell s t 
a developmental process or following exposure to given stimuli. However it h " 
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become clear that the fingerprinting process, whilst still valid, is much too complex 
to be represented by a single technique profile. This is because all differential dSy 
techniques have common and/or unique technical problems which preclude the 
isolate and identification of all those genes which show changes in expression 
Furthermore, there are important genetic changes related to disease develpmem 
which different,.! expression analysis is simply not designed to address. An exampk 
of this is the presence of small deletions, insertions, or point mutations such as those 

morohTsms Pol "T™'- tUm ° Ur SUPPr6SS ° r and -dividual po"! 

morphisms. Polymorphic vanations, small though they usually are, are often 
regarded as being of paramount importance in explaining why^ some pat!en« 
respond better than others to certain drug treatments (and, in logical extension why 

rers^thV'd ,f ^ xenobioi/c.rdno^iS 
others). The identification of such point mutations and naturally occurring 
polymorphisms requires the subsequent application of sequencing, SSCP DGGE 
or TGGE to the gene of mterest. Furthermore, differential display is not desiWd 
to address issues such as alternatively spliced gene species or whether an inc reaped 
abundance of mRNA is a result of mcreased transcription or increased mRNA 



Conclusions 

Perhaps the main advantage of open system differential display techniques is that 
they are no hmited by extant theories or researcher bias in revealing genes which are 
differentially expressed, since they are designed to amplify " genes which 
demonstrate altered expression. This means that they are useful for the iso atTon of 
piously unknown genes which may turn out be useful biomarkers of a par cular 
s ate or condition At least one open system (SAGE) is also qu.ntit.5^™ 
eliminating the need to return to the ori gl nal mRNA and carry out Northern^CR 
analysis to confirm the result. However, the ra pi d progress of genome mappmg 
projects means that over the next 5-10 years or so, the balance of experimental use 
will switch from open to closed differential display systems, particular y DNA 
arrays Arrays are easier and faster to prepare and use, provide quantitative data are 
suitable for high throughput analysis and can be tailored to look at specific ignalC 
pathways or families of genes. Identification of all the gene sequences in human and 
common laboratory animals combined with improved DNA array techn 0 "ow 
means that it will soon no longer be necessary to try to isolate different.Lly x pr essed 
genes using the technically more demanding open system approach. Thus the" 
mam advantage (that of identifying unknown genes) will be largely eradicated It 
ikely, therefore that their sphere of application will be reduced to analysi of th 
less common laboratory species, since it will be some time yet before the genomes 0 f 

ZtZZlT rafiSh> deCtriC g6rbilS ' C " yfish and ^ A* eximpE w£ 
Of course, in the end the question will a ] ways remain: What is the functional/ 
biological significance of the identified, differentially expressed genes' One 
persistent problem is understanding whether differentially expressed genes are a 
cause or consequence of the altered state. Furthermore, many chemicals, such a 
non-genotoxic carcinogens, are also mitogens and so genes associated with 
replication will also be upregulated but may have little or nothing to do with the 
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carcinogenic effect. Whilst differential display technology cannot hope to answer 
these questions, it does provide a springboard from which identification, regulatory 
and functional studies can be launched. Understanding the molecular mechanism of 
cellular responses is almost impossible without knowing the regulation and function 
of those genes and their condition (e.g. mutated). In an abstract sense, differential 
display can be likened to a still photograph, showing details of a fixed moment in 
time. Consider the Historian who knows the outcome of a battle and the placement 
and condition of the troops before the battle commenced, but is asked to try and 
deduce how the battle progressed and why it ended as it did from a few still 
photographs— an impossible task. In order to understand the battle, the Historian 
must find out the capabilities and motivation of the soldiers and their commanding 
officers, what the orders were and whether they were obeyed. He must examine the 
terrain, the remains of the battle and consider the effects the prevailing weather 
conditions exerted. Likewise, if mechanistic answers are to be forthcoming, the 
scientist must use differential display in combination with other techniques, such as 
knockout technology, the analysis of cell signalling pathways, mutation analysis and 
time and dose response analyses. Although this review has emphasized the 
importance of differential gene profiling, it should not be considered in isolation and 
the full impact of this approach will be strengthened if used in combination with 
functional genomics and proteomics (2-dimensional protein gels from isoelectric 
focusing and subsequent SDS electrophoresis and virtual 2D-maps using capillary 
electrophoresis). Proteomics is attracting much recent attention as many of the 
changes resulting in differential gene expression do not involve changes in mRNA 
levels, as decribed extensively herein, but rather protein-protein, protein-DNA and 
protein phosphorylation events which would require functional genomics or 
proteomic technologies for investigation. 

Despite the limitations of differential display technology, it is clear that many 
potential applications and benefits can be obtained from characterizing the genetic 
changes that occur in a cell during normal and disease development and in response 
to chemical or biological insult. In light of functional data, such profiling will 
provide a 'fingerprint' of each stage of development or response, and in the long 
term should help in the elucidation of specific and sensitive biomarkers for different 
types of chemical /biological exposure and disease states. The potential medical and 
therapeutic benefits of understanding such molecular changes are almost im- 
measurable. Amongst other things, such fingerprints could indicate the family or 
even specific type of chemical an individual has been exposed to plus the length 
and/or acuteness of that exposure, thus indicating the most prudent treatment. 
They may also help uncover differences in histologically identical cancers, provide 
diagnostic tests for the earliest stages of neoplasia and, again, perhaps indicate the 
most efficacious treatment. 

The Human Genome Project will be completed early in the next century and the 
DNA sequence of all the human genes will be known. The continuing development 
and evolution of differential gene expression technology will ensure that this 
knowledge contributes fully to the understanding of human disease processes. 

Acknowledgements 

We acknowledge Drs Nick Plant (University of Surrey), Sally Darney and Chris 
Luft (US EPA at RTP) for their critical analysis of the manuscript prior to 
submission. This manuscript has been reviewed in accordance with the policy of the 



Differential gene expression 537 

US Environmental Protection Agency and approved for publication. Approval does 
not signify that the contents reflect the views and policies of the Agency, nor does 
mention of trade names constitute endorsement or recommendation for use. 

References 

YE™' fc'SSi cfT B " M °Z' R " F - KERLAVAGE - a - r - Mc c—« w- R "d 

VENTOR , j.C., 1991, Complementary DNA sequencing: expressed sequence tags and human 

genome project. Science, 252, 1651-1656 numan 
AN, G., Luo G-. Veltr, , R W and O'Hara, S. M., 1996, Sensitive non-radioactive differentia! display 

method using chemiluminescent detection. Biotechniques, 20 342-346 merennal flls P la J 

Axel, R FE.GELSON , and Schultz , G., 1976, Analysis of the complexity and diversity of mRNA from 

chicken liver and oviduct. Cell, 7 247-254 

BAN \pitnel^ tn, i! S ° f n ° rmal and """or-d.ri.ed human mammarv 

epitneiiai ceils expressed in a medium that supports long-term crowth of hnth ,..11 „ ' 

Proceedings of the Naional Academy of Sciences, USA 86 1249^253 * P 

Bauer D., Muuer , H [Reich , J., Reoel , H., Ahrenkiel , V.,' Warthoe, P. and Strauss M 1993 

^^^et^^^^ » - ~ 

^7 f^TI^^ R - STE,NB,SS • H.-H. and Antonfw , 

" ^ ' ~« bi8S t0W " ds «W number 

BRAW, ^!°1, 37^6 indU " d dUfing G0/G1 tranSiti ° n » m0USC fibroblas «- ^ ,„ Cancer 
Burn, T. C, Petrovick , M.S., Hohaus, S., Rollins B T and Tenfn n r 100. 

Cao, J., Cai, X Zheng, L., Ceng, L., Shi, Z., Pad, C. C. and Zheng, S., 1997 Characterisation of 

ANG no G m !I ^fiL^^'iT- M 'x 1998 ' Muhiple chan * e ' s m S ene expression are associated with 
Chen ?7c»^v\ ™ 0dulat % n ° f *f neo P las «' Phenotype. Conc/jWcA, SS.^SSJS 
' whLeTprSion t'.S^f.S T '- A " ™° R ™ ' h S - 3nd YaCER ' J " D " 1 " 6 - Identification of g nes 
S^cSSSu" iS'S m "° SUPP — * of ethinyl estradiol-treated female 

CHEN ' Ln!^ 7& PECK> /I" 'J 96 ' Non - radi ° a «ive differential display method to directly visualise and 
amplify differential bands on nylon membrane. Nucleic Acid Research 24 793-794 

CLONTECHNiQUES 1997a, PCR-Select Differential Screening Kit-the nextstep after Clontech PGR- 
Select cDNA subtraction. ClonTechniques, XII, 18-19 woniecn r^K- 

Clon Techniques , 1997b. Housekeeping RT-PCR amplimers and cDNA probes. ClonTechniaues, XII, 

Dams, M. M*., Cohen D I., n.elsen , E. A., Ste>nmetz , M., Paul, W. E. and Hood L 1984 Cell 
type-specific cDNA probes and the murine I region: the localization and oriemadon of Ad abha" 
Proceedings of the National Academy of Sciences (USA) 81 2194-2198 ° n 0t Ad alpha - 

Dellavalle , R P., Peterson , R. and L^dquist , S., 1994, Preferential deadenvlation of HSP70 mRNA 

Soi, H, r 3 0 64«6f9 U,atln8 HSP?0 eXPreSS1 ° n ln Dr ° SOphlla MTlZ^Cell 
DeR.s. , J. L., Vashwanath , R. L. and Brown, P., 1997, Explonng the metabolic and genetic control of 
gene expression on a genomic scale. Science, 278, 680-686 

Tcii Y N F - ^n^p • n P> C ^ CH ' K ' A - M ° QADAM • F - Huanc. B., Lukyahov , 
jv., uurskaya, i\. t Sverdlov, E. D. and Siebert P n iqqa c, . . 

I a ct ; * Transcriptional activation of cytochrome P4S0 

S" 0 /^ 2?, r i-9. ClaSSCS ° f ChCmiCal lndUCCrS ' ClM °» d E *»™<»<°< P^aZTogy 'I'd 
Duguid , J. R. and Dinauer M. C, 1990, Library subtraction of in vitro cDNA libraries to identifv 

uunbar, P R., Ogg, G. S Chen J., Rust, N., van der Bruggen , P. and Cerundolo , V 1998 Direct 
solation, phenotyping and cloning of low-frequency antigen-specific cvtotoxk T'lymphocv" 
from peripheral blood. Current Biology, 26, 413-416. ' 'ympnocytes 



J- C. Rockett ct al. 



Fitzpatwck ,D. R., Germain -Lee. E and Valle n iooc t i • 

fingerprinting of high density cDNA filte'r , ^"■u An^""" ' H " 1992 ' Hybridisation 
Mammalian Genome, 3, 609^19 C ° NA P °° ls derived from whole 

Griffin , G. and Krishna S 1Q0R r,-«u' • • r 

Physicians, London, 32,' 195-198 " ^ Joumal °f the *W* College of 

Groenink , M. and Leegwater A C T iooa t i - * . 

Cloning of Jurkat cell transcripts tL^tZl^T^ « reaction 
Acetate. Analytical Biochemistry 2A0 90-9/ ph > tohema 8 lut ™ »d phorbol 12-Myrystate 13- 

Hampson , I. N. and Hampson , L., 1997* CCLS anrl nnr>o u 

News (A publication of Am„sh am Lif f Scfen«) 23!?^°" d ° nin8 Sci ™ 

Hampson , I. N., Hampson , L. and Dexter TM oofn , 

(DROP) global amplification of cDNA •' ii"' n !£ ? ireCt,ona ' random oligonucleotide primed 
Acids Research. 24, 4832-4835 ^plication to subtracts cDNA cloning. N« C /eic 

Hampson , I. N., Pope L Cowi iwr r t j „ 
(CCLS):'anewt«h 0 Tfor;h G eg J e^ 

flwearc*, 20, 2899 generation of subtract: ve hybridisation probes. Nucleic Acids 

SS£ s 'SI 0 r "£™ ! /;' 1 7™ «» dte,^. 20 , 5SS . S61 

S»ck B »ch, M..Z.k,n,M.M.«„<i'bai.'.Tan« I HQs t , 
^"»°6i^ K ^-^^^-. O"- • W. F, L„c„ , G. W. ,„ d 

DuBuque, T., Fa^lo a ^.s , w hZ. J" £ H1APELL1 1 B - Ch,ssoe • S - D =™» . N 
M., Le, N., Mardis E 'S ^Io»k m p Mm HULTMAN ' M - KuCABA ' T - Lacy . M. Le 



Differential gene expression 



689 



quantifying cytomegalic endothelial cells in peripheral blood from cytomegalovirus-infected 

patients. Clinical Diagnostic and Laboratory Immunobgy, 5, 622-626. 
Kilty , I. and Vickers , P., 1997, Fractionating DNA fragments generated by differential display PCR. 

Strategies Newsletter (Stratagene), 10, 50-51. 
Kletnjan , D.-J. and van Heyningen , V., 1998, Position effect in human genetic disease. Human and 

Molecular Genetics, 7, 1611-1618. 
Ko, M. S., 1990, An *equalized cDNA library* by the reassociation of short double-stranded cDNAs. 

Nucleic Acids Research, 18, 5705-5711. 
Lake, B. G., Evans, J. G., Cunninghame , M. E. and Price, R. J., 1993, Comparison of the hepatic 

effects of Wy-14,643 on peroxisome proliferation and cell replication in the rat and Syrian 

hamster. Environmental Health Perspectives, 101, 241-248. 
Lake, B. G., Evans , J. G., Gray, T.J. B., Korosi , S. A. and North , C. J., 1989, Comparative studies 

of nafenopin-induced hepatic peroxisome proliferation in the rat, Syrian hamster, guiea pig and 

marmoset. Toxicology and Applied Pharmacology, 99, 148-160. 
Lennard, M.S., 1993, Genetically determined adverse drug reactions involving metabolism. Drug 

Safety, 9, 60-77. 

Levy, S., Todd, S. C. and Maecker, H. T., 1998, CD81(TAPA-1): a molecule involved in signal 

transduction and cell adhesion in the immune system. Annual Review of Immunology, 16, 89-109. 
Liang, P. and Pardee, A. B., 1992, Differential display of eukaryotic messenger RNA by means of the 

polymerase chain reaction. Science, 257, 967-971. 
Liang, P., Averboukh , L., Keyomarsi , K., Sager, R. and Pardee, A., 1992, Differential display and 

cloning of messenger RNAs from human breast cancer versus mammary epithelial cells. Cancer 

Research, 52, 6966-6968. 

Liang, P., Averboukh , L. and Pardee, A. B., 1993, Distribution & cloning of eukaryotic mRNAs by 

means of differential display refinements and optimisation. Nucleic Acids Research, 21 , 3269-3275. 
Liang, P., Bauer, D., Averboukh , L., Warthoe, P., Rohrwild , M. f Muller, H., Strauss, M. and 

Pardee, A. B., 1995, Analysis of altered gene expression by differential display. Methods in 

Enzymology, 254, 304-321. 
Linskens , M. H., Feng, J., Andrews, W. H., Enlow , B. E., Saati, S. M., Tonkin , L. A., Funk, 

W. D. and Villeponteau , B., 1995, Cataloging altered gene expression in young and senescent 

cells using enhanced differential display. Nucleic Acids Research, 23, 3244-3251. 
Lisitsyn , N., Lisiitsyn , N. and Wigler , M., 1993, Cloning the differences between two complex 

genomes. Science, 259, 946-951. 
Lohmann , J., Schickle , H. and Bosch, T. C. G., 1995, REN Display, a rapid and efficient method for 

non-radioactive differential display and mRNA isolation. Biotechniques, 18, 200-202. 
Lunney , J. K., 1998, Cytokines orchestrating the immune response. Reviews in Science and Techoloev 

17, 84-94. 

Makowska, J.M., Gibson, G. G. and Bonner, F.W., 1992, Species differences in ciprofibrate- 
induction of hepaic cytochrome P4504A1 and peroxisome proliferation. Journal of Biochemical 
Toxicology,!, 183-191. 

Maldarelli , F., Xiang, C, Chamoun , G. and Zhchner , S. L., 1998, The expression of the essential 
nuclear splicing factor SC35 is altered by human immunodeficiency virus infection. Virus 
Research, S3, 39-51. 

Mathieu -Daude, F., Cheng , R., Welsh , J. and McClelland , M., 1996, Screening of differentially 
amplified cDNA products from RNA arbitrarily primed PCR fingerprints using single strand 
conformation polymorphism (SSCP) gels. Nucleic Acids Research, 24, 1504-1507. 

McKenzie, D. and Drake, D., 1997, Identification of differentially expressed gene products with the 
castaway system. Strategies Newsletter (Stratagene), 10,19-20. 

McClelland, M., Mathieu .Daude, F. and Welsh, J m 1996, RNA fingerprinting and differential 
display using arbitrarily primed PCR. Trends in Genetics, 11, 242-246. 

Mechler, B. and Rabbitr , T. H., 1981, Membrane-bound ribosomes of myeloma cells. IV. mRNA 
complexity of free and membrane-bound polysomes. Journal of Cell Biology, 88, 29-36. 

Meyer, U. A. and Zanger, U. M., 1997, Molecular mechanisms of genetic polymorphisms of drug 
metabolism. Annual Review of Pharmacology and Toxicology, 37, 269-296. 

Mohler , K. M. and Butler, L. D., 1991, Quantitation of cytokine mRNA levels utilizing the reverse 
transcriptase -polymerase chain reaction following primary antigen-specific sensitization in 
vivo — I. Verification of linearity, reproducibility and specificity. Molecular Immunology 28 
437-447. 

Murphy, L. D., Herzog, C. E., Rudick, J.B., Tito Fojo, A. and Bates, S. E., 1990, Use of the 
polymerase chain reaction in the quantitation of the mdr-1 gene expression. Biochemistry, 29 
10351-10356. ' 

Nelson, D. R., Koymans , L., Kamataki , T., Steceman , J. J., Feyereisen , R., Waxman, D.J., 
Waterman , M. R., Gotoh , O., Coon, M. J., Estabtrook , R. W., Gunsalus , I. C. and Nebert , 
D. W., 1996, Update on new sequences, gene mapping, accession numbers and nomenclature. 
Pharmacogenetics, 6, 1-42. 



690 



J. C. Rockett et aj. 



Nishio , Y., Aiello ,L. P. and King G I 1Q<U ri„ • a j 

«ihJ. mI^T, b > "I»~»«k«I d fferai « 

n, e u !t .ticpo, t „,i, in m ml m^Jl „ ' '°' I" 1 ""!" of 8me! ""CMMd with high 
«2. 199-203. ">™"">y s^nocremon,.,. Jw o/rt, MHiomrf Can*, lnaij,, 

Prashar, Y. and Weissman S M iooa a 1 r j «- 

restriction fragments' of cDN As JwT' of , d f er < nQal expression by display of 3'end 
659-663. ° NAS - Proc <«W* o/ the National Academy of Sciences (USA), 93, 

Ragno, S., Estrada, I., Butler, R. and Colston m t 100-7 o , • 

oJE,p^^ B li£ ' 3 S 6 , 43W Cy "' Cl " 0me P4S0 -' n overvi «"- '«"» 

Richard, L., VaAsco, P. and Detmar m' iqoc a\- i ■ 

isolation and long-term S ofhi™ "7 leimmunoma 8"«c protocol for the selective 
Cell Research, 240, 1-6. " dem,al microvasc «l« endothelial cells. Experimental 

Rockett, J. C, Esoaile , D.J. and Gibson r <~ 100-7 1 

hepatocarcinogenesis using dffercnSdiLv"' ' ^ Pr ° filin8 ° f "°"-8enotoxic 
(ddRT-PCR). £ ur0/(efl „ Jol^r o }D^ £lw VeTSe ^"P^n-polymerase chain reaction 

Rodricks , J. V. and Turmull D 1 987 Tme^mp Pharmacokinetics, 22, 329-333. 

proliferate. Toxico/o^^^ ln P«°™°mes and peroxisome 

Rons S/i^^SS? CharaCtt — ° n * 

Rudin, C. M. and Thompson C B iqqr r j i 

25, 435-446. ' ' B " Cdl devel °P™nt and maturation. Semmar, A, CWo^y, 

Sakaguchi, N., Berger, C. N and Melc-hfb<: it mot t 1 • ~ 

«„, r d i„ pre.B ce^ISokw 5" ™»V CDNA ™" <"'" RNA ««« 

^^SEtf O- of DNA. ,„ N. Po rd . M . 

"^^•/s^X^^i^r"' ° f 1000 8ene! * <*« 

Sch»bdo> -MAU h .OU»»,S.,GlUl>OI-HEtINSmiI P , nt lCHAI.>..v P 100« u 

SEMENZA^r^^Tra^ 8 ^ 

W^M^iS^K^ST 1 re8U,ati0n ° f 8Cne eXpr " S10n : m " ha -- s P3tho P h y sio.o gy . 

SEWAL Luc^" G W D l A 95 CL , A n R d K ' G H °- TR,TSCHER ' A - M - TULLY • D - B - v *n«h Heuvel J and 
C2, ?«Ti09?i834 1 ^ 8Cne transC "P tion: -plications for biomarkers. CttSj 

Singh , N., Agrawal, S. and Rastogi A ^ 1 00-7 if . 

to major ^no^i^^^'J^^^ d '" ases anc 'immunity: special reference 

Smith W P i, a a.T " uu,t y c ' ° m P Emerging Infectious Diseases, 3, 41-49. 

Smith N. R., Li, A., Aldersley , M., High , A. S., Markham A. F and Robinson PA 1007 p -a 
determination of the comnlexirv nf r riMis k j , , jmjbinson , r. a., 1997, Rapid 

gel, ^Xc.d, Resect ^ 3552-^554 ^ ^ DDRT " PCR P°ly«rylamide 

Sompayrac , L., Jane, S. ( Burn T C Tpwcw r. r> j r> , 



Differential -gene expression 



691 



Suno, Y. J. and Denman R. B 1997, Use f two reverse transcriptases eliminates false-positive results 
m differential display. Biotechniques, 23, 462-464 P°»nve results 

Sutton, G., White, 0 Adams, M. and Kerlavage, A., 1995, TIGR Assembler; A new tool for 
assembling large shotgun sequencing projects. Genome Science and Technology 1 9-1 9 

Suzuk,, Y. SEKnrA , T and Hayash, , K., 1991, Allele-specific polymerase chain reaped n: a meth d f r 
amphncati n and sequence determination of a single component among a mixture f seouence 
variants. Analytical Biochemistry, 192, 82-84 •»««!« i sequence 

Syed, V., Gu W. and Hecht N. B., 1997, Sertoli cells in culture and mRNA differential display provide 
xSX C , 5 26™™* 8SSay SySt£m t0 deWCt Chan8 " indU " d by *™oio«c?. JouZSf 

"^n^nntin^VK^ 0800 ";- ? -'J ^T' D " L " and VUGL • J ' 1989 > Two-dimensional DNA 
2742^746 " indlVlduals - Proc ^ings of the National Academy of Sciences (USA), 86, 

ULLMA r r ;m rlfeT^mT" ; L VERWHJ ' C L V 3nd CRABTREE ' G " R - 1990 ' Transmission of signals 
from the T lymphocyte antigen receptor to the genes responsible for cell proliferation and immune 
function: the missing link. Annual Review of Immunology, 8 421-452 

Vasmatzis , G Essand, M , BR.NKMANN , U., Lee, B. and Paston I., 1998', Discovery of three genes 

V "J 0n h*Z%Ws™^ • " an ^' Kinzler , K. W., 1995, Serial analysis of gene 
Voeltz G. K. and Stettz , J. A. 1998, AuuuA sequences direct mRNA deadenylation uncoupled from 
decay during Xenopus early development. Molecular and Cell Biology, 18, 7537-7545 

^lUl. 3 ' K> W " 1993 ' ThC mUltiStCP n8tUre ° f Ca "« r - T ^ ^ Genetics, 9, 

Walter , J Beuteld , M. Hampson I. and Read, C, 1997, A novel approach for generating subtractive 
„, fo b o f ° r d,, I er T cntial screening by CCLS. Life Science News, 21 ,13-14 ""tractive 
Wan.J S., Sharp S.J Po,r«er,G.M.-C., Wagaman, P. C., Chambers, J., Pyat. J Hom Y -L 

n oHff E - "n™' A " PETCRS0N ' R A - J ACKSON ' M.R. and Erlander.M.S 1996' 
Cloning differentially expressed mRNAs. Nature Biotechnology, 14, 1685-1691 
Walter , J Belfield M. Hampson , I. and Read, C, 1997, A novel approach for generating subtractive 
probes for differential screening by CCLS, Life Science News 21, 13-14 *uDtractive 

WAN % z c^ d , SSS S; n5o 9 5-ii A or ne expression screen - p ™ edin8 > « th < * * 

Wawer, C J^hberc, H., Meyer, G. and Muyzer, G., 1995, A simple and rapid electrophoresis 
mT^^ST SeqUen " V3riati0n " PCR ' am P" fied DNA { "*™™- NJric Acids Research, 
Welsh , j., Chada K., Dalal S. S Cheng , R., Ralph, D. and McClelland , M., 1992 Arbitrarily 
i» P £ m * finBerpnnting of RNA. Nucleic Acids Research, 20, 4965-4970 Arbltranl y 

Wong, H., Anderson , W D., Cheng, T. and Rubowol , K. T., 1994, Monitoring mRNA expression 
^polymerase chain reaction: the 'primer-dropping' method. Analytical BiochL^ry S. 

Wong K. K. and McClelland M., 1994, Stress-inducible gene of Salmonella typhimurium identified 
by arbitrarily pnmed PCR of RNA. Proceedings of the National Academy of Sciences^ SA^ 91, 

w ™:^ anti - oncogenes ; the molecular basis of — 

Xhu, D Xhan, W. L . Leung . B. P.. Huang, F P., Wheeler , R„ Pedrahta , D., Rob.nson . J. H. and 
, L f ' fp Y V. 1 998> Se ec VX C c *P resS10 " °f * "able cell surface molecule on type 2 but not type 1 
^t\otrT ct\]s. Journal of Experimental Medicine, lil 787-794 

YANG ' *L",? SY ™* Kt .' A D f' L 1996 ' Clonin 8 differentially expressed genes by linker capture 
subtraction. Analytical Biochemistry, 237, 109-114 « P njre 

Zhao.N Hashida, H Takahashi , N., M.sum. , Y. and Sakak., Y., 1995, High-densitv cDNA filter 
analysis: a novel approach for large scale quantitative analysis of gene expression. Gene, 156, 

Zhao, X j., Newsome J. T. and C.hlar , R. L., 1 998, Up-regulation of two Candida albicans genes in the 
rat model of oral candidiasis detected by differential display. Microbial Pathogenesis, 25 121-129 

Zimmermann C.R.. Orr W C, Leclerc, R. F., Barnard, C. and T.mberlake , W.E 1980 
Molecular cloning and selection of genes regulated in Aspergillus development. Cell, 21, 709-715! 



Exhibit D of Rockett Declaration 
* with Response dated 1 1/18/03 - 

In USSN: 09/867,958 j 
PC-0041 CIP ; 
EUROPEAN JOURNAL OF VoL 22. No. 4. pp. 329-333 



Molecular profiling of non-genotoxic hepato- 
carcinogenesis using differential display reverse 
transcription-polymerase chain reaction (ddRT-PCR) 

J.C. ROCKETT 1 , D J. ESDAILE 2 and G.G. GIBSON 1 

1 Molecular Toxicology Group, School of Biological Sciences, University of Surrey, Guildford, UK 
2 Rh6ne~Poulenc Agrochemicals, Sophia Antipolis, France 

Keywords : ddRT-PCR, non-genotoxic hepatocarcinogenesis, phenobarbital, rat, WY- 14,643 



SUMMARY 



The technique of differential display reverse transcription-polymerase chain reaction (ddRT-PCR) has been used to produce unique 
profiles of up-regulated and down-regulated gene expression in the liver of male Wistar rats following short term exposure to the 
non-genotoxic hepatocarcinogens, phenobarbital and WY-14,643. Animals were treated for 3 days, whereupon their livers were 
extracted and snap frozen. mRNA was prepared from the livers and used for ddRT-PCR. Individual bands from the differential 
displays were extracted and cloned. False positives were eliminated by dotblot screening and true positives then sequenced and 
identified. 



INTRODUCTION 

Safety evaluation of v new chemicals usually necessi- 
tates the examination of genotoxic and carcinogenic 
potential using short-term in vitro and in vivo geno- 
toxicity assays augmented by chronic bioassay tests. 
The short-term assays have proved useful in the early 
identification of potential genotoxic carcinogens, but 
their value is limited by observations which suggest 
that approximately 60% of chemicals identified as car- 
cinogens in life-exposure studies produce mainly 
negative findings in short-term genotoxicity tests (1,2). 
Thus, there is currently no reliable and rapid means of 
evaluating the carcinogenic risk of new chemicals 
which fall into this latter group of compounds, termed 
non-genotoxic (or epigenetic) carcinogens. 



Please send reprint requests to : Dr John Rockett, Molecular 
Toxicology Group, School of Biological Sciences, University 
of Surrey, Guildford, Surrey GU2 5XH, UK. 



It is now evident that non-genotoxic carcinogens 
constitute a group of chemicals which are not only di- 
vergent in their interspecies toxicity, but also demon- 
strate different target organ selectivities and mecha- 
nisms of action (3,4). Elucidation of the molecular 
mechanisms underlying non-genotoxic carcinogenesis 
is currently underway, but the picture is still far from 
complete. It is anticipated that a better understanding 
of the early changes in genetic expression following 
exposure to non-genotoxic carcinogens will aid devel- 
opment of experimental strategies to identify cellular 
markers which are diagnostic for this type of toxicity. 

Subtractive ddRT-PCR is a recently developed 
technique which facilitates the preferential amplifica- 
tion of gene products that demonstrate altered expres- 
sion in target tissue(s) following exposure to chemical, 
stimuli. Furthermore, using this technique, no prior 
knowledge of the specific genes which are up/down 
regulated is required. In the current study, we have un- 
dertaken to develop a specific and rapid assay for non- 
genotoxic carcinogens using the technique of ddRT- 
PCR. This has allowed us to identify characteristic 
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patterns of gene regulation following administration of 
two different non-genotoxic carcinogens (phenobarbi- 
tal and Wy- 14,643) and the subsequent identification 
of individual gene species which are regulated by this 
xcnobiotic treatment 



MATERIALS AND METHODS 
Animals and treatment 

Phenobarbital (BDH, Poole, UK; 100 mg/kg/day) or 
[4-chloro-6-(2,3-xylidino)-2-pyrimidinylthio] acetic 
acid (Wy- 14,643) (Campo, Emmerich; 250 
mg/kg/day) was administered by gavage to groups of 
3 male Wistar rats (150-200 g) on three consecutive 
days, whilst control animals received nothing. All ani- 
mals had free access to food (rat and mouse standard 
diet, B&K Universal, Hull, UK) and water. The ani- 
mals were killed on the fourth day, whereupon their 
livers were excised, sliced into 0.5 cm cubes, snap fro- 
zen in liquid nitrogen and then stored at -70°C. 

mRNA extraction 

Up to 0.25 g of each frozen liver sample was ground 
under liquid nitrogen using a mortar and pestle. 
mRNA was extracted from the ground liver using 
Promega's PolyATtract® System 1000 (Promega, 
Madison, WI, USA) according to the technical man- 
ual. The mRNA was DNase-treated (Promega, final 
concentration 10 U/ml) before phenol/chloroform ex- 
traction and ethanol precipitation. The mRNA was re- 
suspended at a final concentration 500-1000 ng/^il. 

ddRT-PCR 

This was carried out using the PCR-Select™ cDNA 
Subtraction Kit (Clontech, Palo Alto, CA, USA) ac- 
cording to the manufacturer's instructions. Final PCR 
reactions were run on a 2% Metaphor agarose (FMC, 
Rockland, MD, USA) gel containing ethidium bro- 
mide (Sigma, Dorset, UK) and then overstained for 30 
min with SYBR Green I DNA stain (FMC, 1:10 000 
dilution in TAE). 



Band extraction and cloning 

Each discernible band from the differential display 
pattern was extracted from the gel with a scalpel and 



the DNA eluted using a Genelute™ Agarose Spin Col- 
umn (Supelco, Bellefonte). An aliquot of the eluted 
DNA (5 Hi) was re-amplified using the original ddRT- 
PCR nested primers and electrophoresed on a 2% 
agarose gel. The re-amplified band was extracted from 
the gel (as above) and the eluted DNA ligated directly 
into the TOPO TA Cloning® vector (Invitrogen, 
Carlsbad) before transformation in Escherichia coli 
TOP10F One Shot™ cells (Invitrogen). 



Stage 1 screening 

Twelve transformed (white) colonies from each band 
were grown up for 6 h in 200 \i\ LB broth containing 
ampicillin (Sigma, 50 ng/ml) and 1 ^1 of this ampli- 
fied by PCR reaction (as specified in ddRT-PCR tech- 
nical manual). One quarter of the completed reaction 
was electrophoresed on a standard 2% agarose gel and 
one quarter on a 2% agarose gel containing HA Yel- 
low (Hanse Analytik GmbH, Bremen, Germany, 1 
U/nl) to discern the different cloning products. The re- 
mainder was used to prepare duplicate dotblots on Hy- 
bond N+ (nylon) membranes (Amersham, Little Chal- 
font, UK). Cultures containing different cloning prod- 
ucts were grown up and a plasmid miniprep prepared 
from each (Wizard Plus SV Minipreps DNA Purifica- 
tion System, Promega) according to the manufac- 
turer's instructions. 



Stage II screening 

The duplicate dotblots were probed with: (a) the final 
differentia] display reaction; and (b) the 'reverse-sub- 
tracted* differentia] display reaction. To make the •re- 
verse-subtracted ' probe, the subtractive hybridisation 
step of the ddRT-PCR procedure was carried out using 
the original tester cDNA as a driver and the driver as 
a tester. Probing and visualisation were carried out us- 
ing the ECL Direct Nucleic Acid Labelling and Detec- 
tion System (Amersham) according to the manufac- 
turer's instructions. Those clones which were positive 
for (a) but negative for (b), or showed a substantially 
larger positive signal with (a) compared to (b), were 
chosen for further analysis. 



DNA sequencing 

Positive clones as identified above were sequenced on 
an automated ABI DNA sequencer (Applied Biosys- 
tems, Warrington, UK). 
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Fig. 1 : W) Subtractivc ddRT-PCR patterns obtained from rat liver following 3-day treatment with WY-14,643 or phenobarbital. Lane 
1, 1 kb ladder, lane 2, genes up-regulated following Wy, 14-643 treatment; lane 3, genes down-regulated following 
Wy, 14-643 treatment; lane 4, genes up-regulated following phenobarbital treatment; lane 5, genes down-regulated following 
phenobarbital treatment; and lane 6, lkb ladder. (B) Subtract ve ddRT-PCR patterns obtained from rat liver showing relative 
changes when phenobarbital treated mRNA is subtracted from Wy- 14, 643 -treated mRNA and vice- versa. Lane 1, 1 kb 
ladder, lane 2, genes showing increased expression following Wy- 14,643 treatment compared to phenobarbital treatment; 
lane 3, genes showing increased expression following phenobarbital treatment compared to Wy- 14,643 treatment. See 
Materials and Methods for further details. 



Fig. 2 : Re-amplified ddRT-PCR products which were down-regulated following phenobarbital treatment (upregulated bands were also 
re-amplified but gel not shown). Individual DNA bands excised from gel of ddRTR-PCR reactions were extracted, 
re-amplified and run on agarose gels to confirm amplification of correct band (numbered). See Materials and Methods f r 
further details. 



332 E « r - Drug Metab. Pharmacokinet., 1997, No. 4 
Table 1: Rat liver genes down-regulated by phen barbital treatment 



Band number (Fig. 2) 
(Approximate size in bp) 



Pheno barbital down-regulated 

H ighest sequence homology 



FASTA-EMBL gene identification 



1 (1500) 




95.3% 


2 (1200) 




92.3% 


3(1000) 




91.7% 


7(700) 


Clone 1 


772% 




Clone 2 


94.5% 




Done 3 


91.0% 


8(650) 


Clone 1 


86.9% 




Clone 2 


96.2% 


9(600) 


Clone 1 


86.9% 




Clone 2 


82.0% 


10(550) 




73.8% 


11 (525) 




95.7% 


12 (375) 




100.0% 


13 (230) 


Clone 1 


97.2% 




Clone 2 


100.0% 




Clone 3 


100.0% 


14 (170) 




96.0% 


15 (140) 




97.3% 


rs: (300) 




96.7% 


(275) 




93.1% 



Rat mRNA for 3-oxoacyl-CoA thioiase 

Rat hemopoxin mRNA 

ft rattus alpha-2u-globulin mRNA 

M. musculus mRNA for CI inhibitor 

Rat electron transfer flavoprotein 

Mouse topoisomerase 1 (Topo 1) mRNA 

Soares 2NbMT M. musculus (EST) 

Rat alpha-2u-globulin (s-type) mRNA 

Soares mouse NML M. musculus (EST) 

Soares p3NMF19.5 M.musculus (EST) 

Soares mouse NML M. musculus (EST) 

NCI_CGAP_Pr1 H. sapiens (EST) 

ft norvegicus mRNA for ribosomal protein 

Soares mouse embryo NbME135 (EST) 

Rat fibrinogen B-beta-chain 

Rat apoiipoprotein E gene 

Soares p3NMF19.5 M. muscuius (EST) 

Stratagene mouse testis (EST) 

ft norvegicus RASP 1 mRNA 

Soares mouse mammary gland (EST) 



EST = expressed sequence tag. 
Bands 4-6 were shown to be false positives by dotblot analysis and, therefore, not sequenced. 



Table II : Rat liver genes up-regulated by phenobarbital treatment 



Band number 
(Approximate size in bp) 



Phenobarbital up-regulated 

H igh est sequen ce homology 



FA ST A -EM BL gen e iden tification 



5(1300) 




93.5% 


Rat cytochrome P450IIB1 


7(1000) 




95.1% 


mRNA for rat preproalbumin 
Rat serum albumin mRNA 


8(950) 




98.3% 


NCLCGAP_Pr1 H. sapiens (EST) 


10(650) 




95.7% 


Rat cytochrome P450IIB1 


11 (800) 


Clone 1 


94.9% 


Rat cytochrome P450IIB1 


12 (750) 


Clone 2 


75.3% 


Rat cytochrome p450-L (p450IIB2) 




93.8% 


Rat TRPM-2 mRNA 


15(600) 






Rat mRNA for sulfated glycoprotein 




92.9% 


mRNA for rat preproalbumin 
Rat serum albumin mRNA 


16(550) 


Clone 1 


95.2% 


Rat cytochrome P450IIB1 


21 (350) 


Clone 2 


93.6% 


Rat haptoglobulin mRNA partial alpha 




99.3% 


ft norvegicus g nes for 18S, 5.8S & 28S rRNA 



EST = expressed sequence tag. 
Bands 1-4. 6, 9. 13. 14 and 17-20 shown to be false positives by dotblot analysis and. therefore, not sequenced. 



7. C Rockett ex aL, Hepatocarcinogenesis and ddRT-PCR 



333 



Identification of differentially-regulated 
genes 

' Gene-sequences were identified using the FASTA pro- 
gramme (http^/www .ebi.ac.uk/htbin/fasta.py ?request) 
to search all EMBL databases for matching DNA se- 
quences. 

RESULTS 

Figure 1A,B shows the ddRT-PCR patterns of genes 
showing altered expression in rat liver following 3 day 
treatment with phenobarbital or Wy- 14,643. Individual 
bands were isolated from the phenobarbitai-modulated 
patterns (both up- and down-regulated), re-amplified 
(Fig. 2), cloned, screened for false positives and then 
identified. Those xenobiotic-modulated gene products 
identified to date are listed in Tables I and II. 



DISCUSSION 

The advent of combinatorial chemistry has led to the 
synthesis of millions of new chemical compounds, 
many of which may be potentially useful in pharma- 
ceutical, agricultural or industrial applications. How- 
ever, whilst there are tests available for those posing a 
genotoxic activity, there remains no short-term assay 
able to identify those chemicals which may belong to 
the non-genotoxic group of carcinogens. 

We have used an adaptation of the subtractive hy- 
bridisation method - ddRT-PCR - to produce charac- 
teristic profiles or 'fingerprints' of those genes which 
are up-regulated or down-regulated in male rat liver 
following acute exposure to test chemicals. The ddRT- 
PCR profiles are characteristic and unique for each of 
the 2 compounds studied to date. 

A number of those gene species showing altered 
expression following phenobarbital treatment have 
been cloned and identified (Tables I & II). It is inter- 
esting to note the presence of CYP2B2 in the up-regu- 
lated genes. This would, of course, be expected fol- 
lowing exposure to phenobarbital and serves as a posi- 
tive control for the method. Other genes which one 
might normally expect to be up-regulated do not ap- 
pear in the table. However, it should be noted that not 



all bands seen on the differential display were ex* 
tracted and re-amplified due to their being too faint or 
too close to other bands to accurately excise. Further- 
more, it has been well documented [(5) and references 
therein] that a single band extracted from a differential 
display often represents a composite of heterogeneous 
products. We are currently examining new methods to: 
(i) improve resolution of the differential display pat- 
terns (including 2-D agarose gels); and (ii) distinguish 
those ddRT-PCR products which are identical in size, 
but different in sequence. 

Our future efforts will be directed towards deter- 
mining the extent of modulation of a number of the 
genes reported herein using semi-quantitative RT- 
PCR. This should reveal the extent of changes in ex- 
pression of key gene products which may be involved 
in non-genotoxic hepatocarcinogenesis and thus help 
increase understanding of this process. Furthermore, it 
is anticipated that aligning ddRT-PCR profiles of dif- 
ferent non-genotoxic agents found in responsive and 
non-responsive species may enable identification of 
those genes which are mechanistically relevant to the 
non-genotoxic hepatocarcinogenic process. Accord- 
ingly, this approach lends itself well to the identifica- 
tion, characterisation and sub-classification of possible 
different classes of non-genotoxic carcinogens. 
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Abstract 

Understanding the genetic profile of a cell at all stages of normal and carcinogenic development should provide an 
essential aid to developing new strategies for the prevention, early detection, diagnosis and treatment of cancers. We 
have attempted to identify some of the genes that may be involved in peroxisome-proliferator (PP)-induced 
non-genotoxic hepatocarcinogenesis using suppression PCR subtractive hybridisation (SSH). Wistar rats (male) were 
chosen as a representative susceptible species and Duncan-Hartley guinea pigs (male) as a resistant species to the 
hepatocarcinogenic effects of the PP, [4-chloro-6-(2,3-xylidino)-2-pyrimidinylthio] acetic acid (Wy-1 4,643). In each 
case, groups of four test animals were administered a single dose of Wy-1 4,643 (250 mg/kg per day in corn oil) by 
gastric intubation for 3 consecutive days. The control animals received corn oil only. On the fourth day the animals 
were killed and liver mRNA extracted. SSH was carried out using mRNA extracted from the rat and guinea pig 
livers, and used to isolate genes that were up and downregulated following Wy-1 4,643 treatment. These genes 
included some predictable (and hence positive control) species such as CYP4A1 and CYP2C11 (upregulated and 
downregulated in rat liver, respectively). Several genes that may be implicated in hepatocarcinogenesis have also been 
identified, as have some unidentified species. This work thus provides a starting point for developing a molecular 
profile of the early effects of a non-genotoxic carcinogen in sensitive and resistant species that could ultimately lead 
to a short-term assay for this type of toxicity. © 2000 Elsevier Science Ireland Ltd. All rights reserved. . 
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RT-PCR; Rat; Guinea pig; Gene regulation; Differential gene display; Gene profiling 
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Intr duction 

The advent of combinatorial chemistry and 
>mputer-aided drug design has led to a recent 
psurge in the number of chemical compounds 
tat have potential therapeutic, agricultural and 
idustrial applications. Although it has been sug- 
jsted that the contribution of synthetic chemicals 
> the overall incidence of human cancer is low, 
lere still remains an absolute requirement to 
/aluate all new chemicals for toxic and carcino- 
jnic potential. The latter is one of the most 
roblematic areas of chemical safety evaluation 
id is usually carried out using short-term in vitro 
id in vivo genotoxicity assays augmented by 
ironic bioassay tests. The short-term assays have 
roved useful in the early identification of poten- 
al genotoxic carcinogens, but their value is lim- 
sd by observations that suggest that 
^proximately 60% of chemicals identified as car- 
nogens in life-exposure studies produce mainly 
;gative findings in short-term genotoxcity tests 
^shby, 1992; Parodi, 1992). Thus, there is cur- 
ntly no reliable and rapid means of evaluating 
e carcinogenic risk of new chemicals that fall 
to this latter group of compounds, termed non- 
notoxic (or epigenetic) carcinogens. 
One approach to addressing this problem is to 
ucidate the molecular mechanisms by which 
lown non-genotoxic carcinogens act. It should 
en be possible to identify common factors/ 
echanisms that can serve as early biomarkers of 
rcinogenic potential for new chemicals. To this 
d, a large number of groups have reported on 
e various effects of non-genotoxic compounds 

various animal species (Marsman et al., 1988; 
ike et al., 1993; Cattley et al., 1994; Hayashi et 
, 1994; Human and Experimental Toxicology, 
94; Anderson et al., 1996). However, the mech- 
istic picture is still far from complete with many 
those genes involved in the carcinogenic pro- 
;s remaining unknown, and their identification 
before remains a key goal in elucidating the 
Secular mechanisms by which non-genotoxic 
cinogenesis occurs. 

Subtractive hybridisation (SH) and related tech- 
logies such as representational difference analy- 
(RDA) (Hubank and Schatz, 1994) and 



differential display (DD) (Liang and Pardee, 
1992) can be used to aid the isolation of genes 
showing altered expression in target tissues fol- 
lowing exposure to a chemical stimulus. These 
techniques can also be used to identify differential 
gene expression in neoplastic and normal cells 
(Liang et al., 1992), infected and normal cells 
(Duguid and Dinauer, 1990), differentiated and 
undifferentiated cells (Sargent and Dawid, 1983; 
Guimaraes et al., 1995), activated and dormant 
cells (Gurskaya et al., 1996; Wan et al., 1996), 
different cell types (Hedrick et al., 1984; Davis et 
al., 1984) amongst others. Most importantly, us- 
ing such approaches, no prior knowledge of the 
specific genes that are upregulated/downregulated 
is required. 

Using a variation of SH, termed suppression- 
PCR subtractive hybridisation (SSH) (Diatchenko 
et al., 1996), we have previously reported the 
isolation of a number of genes showing altered 
expression in male rat liver following acute expo- 
sure to phenobarbital (Rockett et al., 1997). In 
the current work we have used the same experi- 
mental approach to isolate genes that are differen- 
tially expressed in the livers of male rats and 
guinea pigs following short-term (3-day) exposure 
to the peroxisome proliferator (PP) and non- 
genotoxic hepatocarcinogen, Wy-14,643. We have 
isolated and identified a number of gene species, 
some of which may be important in the induction 
of, or protection against, non-genotoxic 
hepatocarcinogenesis. 



2. Materials and methods 

2.7. Animals and treatment 

All animal experiments were undertaken in ac- 
cordance with Her Majesty's Home Office De- 
partment guidelines under the auspices of 
approved personal and project licences. Male 
Wistar rats (150-200 g) and male Duncan-Hart- 
ley guinea pigs (250-300 g) were obtained from 
Kingman and Bantam (Hull, UK). Upon receipt, 
both groups were randomly assigned into two 
groups of four. They were maintained on a rat, 
mouse or guinea pig standard diet (B&K Univer- 
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sal, Hull) and a daily cycle of alternating 12-h 
. periods of dark and light. The room temperature 
was maintained at 19°C and a relative humidity of 
55%. The animals were acclimatised to this envi- 
. ronment for 7 days before treatment commenced. 
[4-chloro-6-(2 5 3-xy]idino)-2-pyrimidinylthio] acetic 
acid (Wy- 14,643, Campo, Emmerich; 250 mg/kg 
per day in corn oil) was administered by gavage 
to the treated groups of rats and guinea pigs on 3 
consecutive days, whilst control groups received 
an equal volume of corn oil only. During this 
time, all animals had free access to food and 
water. The animals were killed by cervical disloca- 
tion on the fourth day, and their livers immedi- 
ately excised, weighed, sliced into approximately 
0.5-cm cubes, snap frozen in liquid nitrogen and 
stored at - 70°C. 

2.2. mRNA extraction 

Approximately 0.25 g of each frozen liver sam- 
ple was ground under liquid nitrogen using a 
mortar and pestle. Messenger RNA was extracted 
from the ground liver using the PolyATtract® 
System 1000 kit (Promega, Madison, USA) ac- 
cording to the technical manual provided by the 
manufacturers. The mRNA was DNase-treated 
(RQ Rnase-free Dnase, Promega, final concentra- 
tion 10 U/ml) before phenol/chloroform extrac- 
tion and ethanol precipitation. The mRNA was 
redissolved at a final concentration 500-1000 ng/ 
Hi. 

2.3. cDNA Subtraction 

This was carried out using the PCR-Select™ 
cDNA Subtraction Kit (Clontech, Palo Alto, 
USA) according to the manufacturer's instruc- 
tions. Subtractions were carried out with mRNAs 
derived from single animals. The mRNA from the 
remaining three animals in each group was later 
used for quantitative RT-PCR analysis of specific 
genes. 

2.4. Band extraction and cloning 

The secondary PCR reactions from the cDNA 
subtraction procedure were run on a 2% 
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Metaphor agarose gel (FMC, Rockland, USA) 
containing 0.5 ng/ml ethidium bromide (Sigma, 
Dorset, UK). One times TAE (0.04 M Tris-ac- 
etate, 0.001 M EDTA) was used to prepare the gel 
and as the running buffer. After running for 6-7 
h at 3.75 V/cm, the gel was overstated for 30 min 
with SYBR Green I DNA stain (FMC, 1:10000 
dilution in 1 x TAE). Each discernible band of 
the differential display pattern was extracted from 
the gel with a scalpel and the DNA eluted using a 
Genelute™ agarose spin column (Supelco, Belle- 
fonte, USA). Five microlitres of the eluted DNA 
was reamplified using the original nested (sec- 
ondary) PCR primers supplied with the PCR-Se- 
lect™ cDNA subtraction kit. The PCR products 
were electrophoresed on a 2% standard agarose 
gel (Boehringer Mannheim, East Sussex, UK) and 
the reamplified target bands extracted from the 
gel as above. The eluted DNA was immediately 
ligated into a TOPO TA Cloning® vector (Invitro- 
gen, Carlsbad, USA) before transformation in 
Escherichia coli TOPI OF' One Shot™ cells 
(Invitrogen). 

2.5. Colony screening 

2.5.1. Stage I 

Eight transformed (white) colonies from each 
band were grown up for 6 h in 200 nl LB broth 
containing ampicillin (Sigma, 50 mg/ml). One mi- 
crolitre of this was subjected to PCR using the 
same conditions and nested primers as described 
above. One tenth (2 yl) of the completed PCR 
reaction was electrophoresed on a 2% standard 
agarose gel and one tenth on a 2% standard 
agarose gel containing HA red (Hanse Analytik 
GmbH, Bremen, Germany, 1 U/ml) to discern the 
differentially cloned products. The remainder of 
the PCR reaction was used to prepare duplicate 
dotblots on Hybond N + membranes (Amersham, 
Little Chalfont, UK). 

2.5.2. Stage II 

The duplicate dotblots were probed with (a) the 
final differential display reaction and (b) the 're- 
verse-subtracted' differential display reaction. To 
make the 'reverse-subtracted' probe, the subtrac- 
tive hybridisation step of the differential display 
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RT-PCR procedure was carried out using the 
original tester (treated) mRNA as the driver and 
the original driver (control) mRNA as the tester. 
Probing and visualisation were carried out using 
the ECL direct nucleic acid labelling and detec- 
tion system (Amersham, Little Chalfont, UK) ac- 
cording to the manufacturer's instructions. Those 
clones that were positive for (a) but negative for 
b), or showed a substantially larger positive sig- 
nal with (a) compared to (b), were selected for 
DNA sequence analysis. 

2.6. DNA sequencing 

The remainder of the cultures (prepared in 
>tage I screening) containing different cloning 
oroducts (as discerned in the two screening steps) 
vere grown up overnight in 5 ml LB broth con- 
aining ampicillin (50 mg/ml). A plasmid miniprep 
vas prepared from each (Wizard Plus SV 
Vlinipreps DNA purification system, Promega) 
iccording to the manufacturer's instructions. The 
;loned inserts were sequenced on an automated 
\B\ DNA sequencer (Applied Biosystems, War- 
ington, UK) using the M13 forward primer 
GTAAAACGACGGCCAGT) or M13 reverse 
)rimer (AACAGCTATGACCATG). 

1 7. Identification of differentially regulated genes 

Gene sequences thus obtained were identified 
ising the FASTA 3.0 programme (Lipman and 
'earson, 1985; Pearson and Lipman, 1988) (http:/ 
www.ddbj.nig.ac.jp/E-mail/homology.html) to 
earch all EMBL databases for matching DNA 
equences. Each clone sequence was submitted in 
he forward and reverse direction, and the one 
burning the highest statistical probability of 
latch to a known sequence was noted. Sequence 
omologies between our submitted clone sequence 
nd the queried database sequence were deter- 
lined (by FASTA) over a region of at least 60 
ase pairs. 

.8. RT-PCR analysis of selected candidate genes 

cDNA sequences of the target genes were ob- 
tined from the NIH gene database (GenBank at 



http://www.ncbi.nlm.nih.gov/Web/Search/index. 
html) and the computer programme gene 
jockey (BioSoft, Cambridge, UK) used to select 
primer pairs from these sequences. Where guinea 
pig sequences were available, rat and guinea pig 
sequences were aligned and primers chosen from 
regions of homology. If guinea pig sequences were 
not available, rat and human sequences were 
used. In cases where exact homology could not be 
found, the sequence from the rat was used. In the 
case of CD81 only, no rat or guinea pig sequences 
were available and so mouse and human se- 
quences were aligned and a primer pair chosen 
from a region of homology. Primers (obtained 
from Gibco-BRL, Paisley, UK) were dissolved at 
a concentration of 50 pmol/nl in sterile distilled 
water and stored at -20°C. The primer pairs 
used plus other reaction parameters are shown in 
Table 1. mRNA was extracted (as described 
above) from all four treated animals and from 
three animals in the control group. Integrity of 
the eluted mRNA was confirmed on a 2% agarose 
gel, and the concentration and purity were mea- 
sured using a Genequant II spectrophotometer 
(LKB, Bromma, Sweden) and then diluted to 10 
ng/jil. One microlitre of this latter solution was 
used per RT-PCR reaction. 

RT-PCR was carried out in a single tube (50 
reaction using the Access RT-PCR system 
(Promega) according to manufacturer's instruc- 
tions. In the kinetic and quantitative analyses, 
omission of RNA was used as a control for the 
presence of any contaminating DNA. After ob- 
taining a PCR signal of the correct size and 
optimising the reaction conditions, each PCR 
product was digested with between two and four 
separate restriction enzymes. Specific restriction 
patterns were thus obtained, which further confi- 
rmed the identity of the PCR products as being 
the original target genes. Kinetic analysis (14-32 
cycles) was then performed in each case to deter- 
mine the location of the mid-log phase. 

For the semi-quantitative analysis of each 
target gene, RT-PCR reactions were carried out in 
triplicate for each sample to reduce the effect of 
intertube RT-reaction variations (Kolls et aL, 
1993) and pipetting errors. For each gene, a mas- 
termix containing enough reagents for three times 
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the number of samples (seven for rat, six for 
guinea pig) was prepared except that mRNA was 
omitted, the latter being added after aliquoting 49 
pi of the mastermix into an appropriate number 
of tubes. Amplification of albumin (the reference 
gene) was carried out in separate tubes since the 
mid-log phase of this gene is at a much lower 
cycle number than the target genes due to its high 
abundance. All RT-PCR products were analysed 
on 2% agarose gels containing 0.5 |ig/ml ethidium 
bromide. The target gene samples were loaded on 
the gel first and run in at 3 V/cm for 10 min. The 
corresponding albumin samples were then loaded 
and the gel run for a further 1/2 h. In this way, all 



RT-PCR products from each target gene and 
albumin from the corresponding samples could be 
run on the same gel. Gels were photographed 
using type 665 posi-neg film (Sigma) and quanti- 
tation of the band intensity was carried out using 
a dual wavelength flying spot laser scanner densit- 
ometer (Shimadzu). 

2.9. Statistical analysis 

Statistical analysis of unpaired samples was car- 
ried out using the two-tailed Student's /-test. Val- 
ues were considered statistically significant at 
P < 0.05 or less. 
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Fig. 1. Final displays of differentially expressed genes that 
were (1) upregulated and (2) downregulated in rat (A) and 
guinea pig (B) livers following 3-day treatment with Wy- 
14,643. mRNA extracted from control and treated livers was 
used to generate the differential displays using the PCR-Select 
cDNA subtraction kit (Clontech). Lane (L) is a 1 Kb DNA 
Ladder standard and 10 u! of secondary PCR reaction were 
loaded in all other lanes. 



3. Results 

3. 1. Cloning and screening of transcripts 

For both the rat and guinea pig experimental 
groups, cDNA subtraction was carried out in the 
forward (control driving tester) and reverse (tester 
driving control) directions to isolate both upregu- 
lated and downregulated mRNA species respec- 
tively. Using a standard primary hybridisation 
time of 8 h we obtained a substantial amount of 
non-specific products in all the final differential 
displays (data not shown). This background 
smearing was almost completely removed by re- 
ducing the primary hybridisation time to 4 h 
(CLONTECHniques, 1996). Fig. 1 shows the 
ddRT-PCR patterns of genes showing altered ex- 
pression in rat and guinea pig liver following 
3-day treatment with Wy-14,643. The profiles are 
unique for each species, and in each case the 
profile for the upregulated genes (control mRNA 
driving tester mRNA) is different to that obtained 
for the downregulated genes (tester mRNA driv- 
ing control mRNA). 

The practical outcome of the SSH method is 
that a series of differentially expressed genes is 
observed as a ladder on an agarose gel. The 
majority of these gene fragments fall within the 
150-2000 bp range, with bands up to 5 Kbp 
occasionally being observed. Each band may the- 
oretically consist of one or more products of 
similar size, as the gel has a maximum resolution 
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Fig. 2. Discrimination of different ddRT-PCR products having 
the same molecular size using HA-red. Gel (A) is a 2% 
standard agarose gel. Gel (B) is a 2% standard agarose gel 
containing 1 U/ml HA-red. Band numbers refer to the sequen- 
tial bands (largest to smallest) extracted from the original 
display of genes upregulated in rat liver following 3-day treat- 
ment with Wy-14,643. Ten micorlitres of each PCR reaction 
were loaded per lane. 



of approximately 1.5% (3 bp per 200). In addi- 
tion, there may be two or more products that are 
the same size, but have a different sequence. 
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Therefore some form of discrimination must be 
employed to isolate as many of these products as 
possible. HA-red screening (Geisinger et al M 1997) 
of a number of clones derived from each band 
provided a means to discriminate between differ- 
ent gene species of the same size. A typical exam- 
ple of such a gel is shown in Fig. 2. In total, 88 
and 48 apparently different clones were obtained 
from the final differential expression patterns of 
upregulated and downregulated rat genes, respec- 
tively. Sixty nine and 89 apparently different 
clones were obtained from the final differential 
expression patterns of the upregulated and down- 
regulated guinea pig genes, respectively. 

Having identified as many different candidate 
gene products as possible in the screening step I, a 
second screening step was carried out on every 
clone to confirm those that represented true dif- 
ferentially expressed genes. This is necessary since 
no subtraction technique is 100% efficient. The 
approach we used, termed PCR-select differential 
screening (as recommended in Clontech's PCR-se- 
lect cDNA subtraction kit protocol), utilises the 
forward and reverse subtractions as an aid to 
screening for the true differentially expressed 
genes (CLONTECHniques, 1997). Because these 
probes have already undergone subtraction, they 
have been enriched for differentially expressed 
genes and are therefore more sensitive than un- 
subtracted driver/tester cDNA probes for detect- 
ing true differential expression. All the clones that 
were isolated from each display were dotblotted 
and probed with the display from which they was 
obtained, plus the corresponding reverse-sub- 
tracted display. An example of such a blot is 
shown in Fig. 3. Clones corresponding to authen- 
tic differentially expressed mRNAs hybridised 
with the subtracted cDNA probe, but not the 
reverse-subtracted probe. We also included in the 
authentic positives, those clones that gave a sub- 
stantially greater signal with the subtracted probe 
compared to the reverse-subtracted probe. False 
positives hybridised with either both probes or 
with neither probe. Of the original 88 upregulated 
and 48 downregulated rat clones selected for this 
screening step, 28 (32%) and 15 (31%) respec- 
tively, were found to be true positives. In the rat, 
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8 (100%) of the true positive upregulated genes 
Table 2) and 11 (73%) of the true positive down- 
egulated genes (Table 3) were non-redundant. Of 
he original 69 upregulated and 89 downregulated 
uinea pig clones selected for this screening step, 
8 (70%) and 37 (42%) respectively, were found to 
»e true positives. Thirty six (75%) of the upregu- 
ited genes (Table 4) and 33 (89%) of the down- 
egulated genes (Table 5) were non-redundant. 

.2. Identification of clones 

On sequence analysis it was found that some 
lones were unsequencable in the first instance 
V113 forward primer) due to long polyA runs 
lat appeared to prematurely terminate the se- 
uencing reaction. These clones were therefore 
-'sequenced from the opposite direction using the 
113 reverse primer. Those xenobiotic-modulated 
me products identified to date are listed in Ta- 
es 2 and 3 (rat) and Tables 4 and 5 (guinea pig). 




3. Dot blots of clones of putative upregulated gene species 
ated from guinea pig liver following 3-day treatment with 
-14,643. All clones identified in the stage I screening step 
methods) were blotted and probed with (A) the differen- 
display from which they originated (control driving 
ted) and (B) the reverse subtraction (treated driving con- 
). Arrows indicate some of the true differentially expressed 
tes. 



Table 2 

Identification of genes that were upregulated in male rat liver 
following 3-day treatment with WY-14,643- 



FASTA-EMBL gene 
identification (rat un- 
less otherwise stated) 



Accession No. 



Sequence 
homology" (%) 



Carnitine octanoyl 

transferase 
NCI_CGAP_Lil (H. 

sapiens) (EST 1 *) 
Peroxisomal enoyl 
hydratase-like 
protein 
Liver fatty acid bind- 
ing protein 
Soares mouse 
p3NMF19.5 Af. 
musculus cDNA 
clone 

Cytochrome 
p450IVAl 

Mit. 3-hydroxyl-3- 
methylglutaryl 
CoA synthase 

Rabgeranylgeranyl 
transferase compo- 
nent B 

Genes for 18S, 5.8S, 
and 28S ribosomal 
RNAs 

Carnitine acetyl 
transferase (mouse) 

Soares mouse NML 
(EST) 

Bone marrow stromal 
fibroblast (H. sapi- 
ens) cDNA clone 
HBMSF2E4 (EST) 

7.5dpc embryo 
(mouse) (EST) 

Alpha- 1 -macroglobu- 
lin 

Transferrin 
Lecithinxholesterol 

acyltransferase 
Zn-oc2-glycoprotein 
Serum albumin 
Fructose- 1,6-bisphos- 

phate 1-phospho- 

hydrolase 
Soares mouse 

melanoma (EST) 

(S c ) 
Soares mouse 

3NbMS (EST) 

(AS C ) 



RN26033 
HS 1275949 
RN08976 

V01 235 
AA038051 



RNRRNA 



AA408192 

RNALPH1M 

RNTRANSA 
RNU62803 

RNZA2GA 
RNJALBM 
RNFBP 

AA 124706 

AA 154039 



99 
98 
98 

96 
96 



RNCYPLA 94 
RNHMGCOA 94 

RNRABGERA 94 



94 



MMRNACAR 92 
MM1157113 92 
AA545726 92 



92 

91 

91 
90 

90 
89 
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Table 2 (Continued) 



FASTA-EMBL gene Accession No. Sequence 
identification (rat un- homology 8 (%) 

Jess otherwise stated) 



1 7-f^hydroxsteroid de- RN 1 7BHDT2 87 

hydrogenase 
Soares mouse AAO38051 87 

p3NMF19.5 (EST) 
Peroxisomal enoyl- RNPECOA 85 

CoArhydratase -3- 

hydroxyacyl CoA 

Afunctional enzyme 
Integral membrane S45012 81 

protein, TAPA-1 

(CD81) (mouse) 
Scares mouse lymph MMAA88445 81 

node (EST) 
H. sapiens (clone L40401 76 

zap!28) mRNA 
Lysophospholipase ho- HSU67963 76 

mologue (human) 
Soares mouse lymph AA2 17044 74 

node (EST) 



a Refers to the nucleotide sequence homology between the 
cloned band isolated from the differential display and the 
corresponding gene derived from the EMBL gene sequence 
bank. 

b EST is 'expressed sequence tag' — a gene of as yet 
unknown identity and function. 

c Where sequence homologies were equal in both directions 
of the isolated band, both the sense (S) and antisense (A) 
identities are given. 



In all cases, both the forward and reverse se- 
quence of the target clones were analysed and the 
gene having the highest statistical homology 
noted. 

3.3. RT-PCR analysis of selected clones 

The results of a typical RT-PCR semi-quantita- 
tion experiment for transferrin in the rat is given 
in Fig. 4 and the results for a total of 12 selected 
genes in both the rat and guinea pig are shown in 
Table 6. 



21 
Table 3 

Identification of genes that were downregulated in male rat 
liver following 3-day treatment with Wy- 14,643 

FAST-EMBL gene Accession No. Sequence 
identification (rat un- homology* (%) 

less otherwise stated) 



NCI CCA? Lil (H 


AA484528 


OG 

77 


saniens) fEST b WS c t 






NCT CHAP Prl (hf 




00 

77 


SUpiCfli ) \Co I )\f\J } 






T Til W ftl 1 1 l API fl 

u Ur -giucuronosy j- 




00 
70 


transf erase 












Complement compo- 


KIN K^j 


96 


nent c3 






Soares mouse pla- 


AA023305 


96 


centa (b) 






Ape (chimpanzee) 28S 


r J RuML 


96 


rRNA (AS) 






Rat CYP2C11 


RNCYPM1 


95 


Ribosomal protein S5 


RNRPS5 


94 


Transthyretin 


RNTTHY 


94 


Contrapsin-like 


RNCCP23 


89 


protease inhibitor 






Prostaglandin F2a (S) 


RN26663 


84 


(3-2-microglobulin 


RNB2MR 


84 


(AS) 






Apolipoprotein C-III 


RNAPOA02 


82 


Parathymosin-alpha 


RN11ZNBP 


75 



(zinc2 + -binding 
protein) 



a Refers to the nucleotide sequence homology between the 
cloned band isolated from the differential display and the 
corresponding gene derived from the EMBL gene sequence 
bank. 

b EST is 'expressed sequence tag* — a gene of as yet 
unknown identity and function. 

c Where sequence homologies were equal in both directions, 
both the sense (S) and antisense (A) identities are given. 



4. Discussion 

It is now apparent that all cancers arise from 
accumulated genetic changes within the cell. Al- 
though documenting and explaining these changes 
presents a formidable obstacle to understanding 
the different mechanisms of carcinogenesis, the 
experimental methodology is now available to 
begin attempting this difficult challenge. In order 
to begin the elucidation of the molecular mecha- 
nisms involved in non-genotoxic hepatocarcino- 
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enesis, we have used SSH to identify a number of Lake et al., 1989; Makowska et al., 1992; Lake et 

enes that are upregulated or downregulated in al., 1993). 

aale rat and guinea pig livers following short Gurskaya et al. (1996), who originally devel- 

;rm exposure to the PP, Wy-14,643. We have oped the SSH technique, cloned the products of 

sed the rat model to represent a species suscepti- the secondary PCR reaction and screened a small 

le to the non-genotoxic carcinogenic effect of number of randomly selected colonies for differ- 

Ps and the guinea pig as a resistant species entially expressed clones using northern hybridisa- 

Drton et al., 1984; Rodricks and Turnbull, 1987; tion. However, we decided against this approach 

able 4 

lentification of genes that were upregulated in male guinea pig liver following 3-day treatment with WY-14,643 



ASTA-EMBL gene identification (guinea pig unless otherwise stated) Accession No. Sequence 

homology" (%) 



3rboxylesterase ABO 10634 97 

omplement C3 protein (GPC3) M34054 97 

ytosolic aldehyde dehydrogenase (sheep) U 12761 92 

atalase (human) X04076 89 

iitochondrial aspartate aminotransferase (pig) Ml 1732 89 

ongation factor-1 -alpha (rabbit) X62245 88 

CI_CGAP_Br2 H. sapiens cDNA clone (EST) (Similar to chick mit. phosphoenolpyru- AA587436 87 

vate carboxykinase) 

lpha-1-anuproteinase S M57270 83 

)-formyltetrahydrofolate dehydrogenase (rat) M59861 83 

ibosomaJ protein L6 (rat) X87107 83 

>ares pregnant uterus Nb (EST) (mouse) AA 156847 83 

itochondrial citrate transport protein (human) L77567 80 

/toplasmic chaperonin hTRiC5 (human) U 17104 80 

;pha-l-antiproteinase F M 57271 77 

Merogeneous nuclear ribonuclearprotein cl/c2 (human) D28382 77 

>ares parathyroid tumour (EST) (similar to human serum albumin precursor) AA860651 76 

ratagene mouse kidney (EST) AA107327 75 

)ares parathyroid tumour NbHPA human cDNA (EST) AA860653 74 

>ares mouse mammary gland (EST) AA6 19297 74 

DNA clone 15 004 (EST) (human) HO 1826 74 

lares senescent fibroblasts (EST) (mouse) W52190 74 

eproalbumin (human) E04315 72 

DNA clone 73 169 (EST) (human) T56624 72 

tamin D-binding protein (human) LI 0641 71 

ooH gene (exon 8) (human) Yl 1498 71 

>RL flow sorted chromosome B05457 71 

lares foetal liver spleen (EST) (mouse) AA009524 71 

>ares foetal heart NbMH19W (EST) (mouse) AA009421 69 

>ares foetal heart NbHH19W H. sapiens cDNA clone (EST) W94377 67 

lenylalanine hydroxylase (human) U49897 67 

oIine-5-carboxylate dehydrogenase (human) U24266 66 

utathione-S-transferase homologue (human) U90313 65 

ZI_CGAP_GCBI (EST) (human) AA769294 65 

otective protein (human) M22960 64 

one 27 375 (EST) (human) N37046 62 

-atagene colon ( # 937 204) H. sapiens cDNA clone (EST) AA149777 62 



1 Refers to the nucleotide sequence homology between the cloned band isolated from the differential display and the correspond- 
l gene derived from the EMBL gene sequence bank. 
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Table 5 

. Identification of genes that were downregulated in male guinea 
pig liver following 3-day treatment with WY-14,643 

FASTA-EMBL gene Accession N . Sequence 
-identification (guinea homology 11 (%) 

pig unless otherwise 
stated) 



Complement C3 


M34054 


97 


protein 






Murinoglobulin 


D84339 


95 


Alpha- 1-an- 


M57271 


88 


tiproteinase F 






Elongation factor-al- 


X62245 


89 


pha- 1 (rabbit) 






Coupling protein G 


X04409 


88 


(human) 






NCl_CGAP_Ovl 


AA586309 


87 


(EST b ) (human) 






Lecithinxholesterol 


D13668 


85 


acetyl transferase 






(rabbit) 






Aldolase B (human) 


X00270 


84 


Anti-thrombin III 


E00116 


80 


(human) 






Phenylalanine hy- 


K03020 


80 


droxylase (human) 






Inter-oc-trypsin in- 


D38595 


79 


hibitor (human) 






Normalised rat mus- 


AA849753 


78 


cle (EST) (S c ) 






Normalised rat ovary 


AA803059 


78 


(EST) (AS C ) 






Complement factor 


X00284 


77 


Ba fragment (hu- 






man) 






Dihydrodiol dehydro- 


U05598 


76 


genase (human) 






Spot 14 gene (thyroid- 


Y08409 


75 


inducible hepatic 






protein )(human) 






BAC clone 174pl2 


AC004236 


75 


(human) 






Mitochondria] alde- 


X05409 


74 


hyde dehydroge- 






nase (human) 






Preproalbumin (hu- 


E04315 


74 


man) 






NCI_CGAP_Pr9 


AA533142 


74 


(EST) (human) (S) 






Normalised rat pla- 


AA85I197 


74 


centa (EST) (AS) 






Heparin sulfate pro- 


J04621 


73 


teoglycan (human) 






cDNA clone 33 992 


R24330 


73 


(EST) (human) 







Table 5 {Continued) 



FASTA-EMBL gene Accession No. Sequence 
identification (guinea homology* (%) 

pig unless otherwise 
stated) 



Retinol dehydrogenase 


U33501 


71 


(rat) 






TAPA-1 integral mem- 


S45012 


71 


brane protein 






(CD81) (mouse) 






Complement compo- 


M35525 


70 


nent c5s 






Apolipoprotein B (pig) 


LI 1235 


69 


cDNA clone 143 918 


R76742 


68 


(EST) (human) 






a-fibrinoeen (human) 

" ~ * C ^ ------- 1 


K02569 


68 


Soares foetal liver 


W03726 


68 


spleen INF (mouse) 






Barstead bowel (EST) 


AA232049 


67 


(mouse) 






UDP glucuronosyl 


AF0309137 


66 


transferase (cat) 






Myeloid leukaemia cell 


L08246 


65 


differentiation 






protein (MCL-1) 






(human) (S) 






STS SHGC-34 987 (hu- 


G27984 


65 


man) (AS) 






Soares mouse 


AA222798 


64 


3NME125 






Stratagene mouse em- 


AA 199420 


64 


bryonic (EST) (S) 






Rad 52 (mouse) 


AF004854 


63 



a Refers to the nucleotide sequence homology between the 
cloned band isolated from the differential display and the 
corresponding gene derived from the EMBL gene sequence 
bank. 

b EST is 'expressed sequence tag' — a gene of as yet 
unknown identity and function 

c Where sequence homologies were equal in both directions, 
boththe sense (S) and antisense (A) identities are given. 

for several reasons: (1) the kinetics of ligation and 
transformation favour the isolation of smaller 
PCR products, thereby producing a misrepresen- 
tation of larger gene products; (2) northern blot 
analysis is notoriously insensitive and is unlikely 
to confirm expression of rare transcripts; (3) there 
is no measurable end point to the screening of 
clones produced in this way other than to analyse 
every transformed colony. We used instead an 
alternative approach; after running out the differ- 
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ential display on a high-resolution agarose gel 
(Fig. 1) and overstaining with SYBR Green I to 
enhance visualisation, the composite bands were 
individually extracted, reamplified and cloned. 
However, it has been well documented that single 
bands from differential displays often contain a 
heterogeneous mixture of different products 
(Mathieu-Daude et al., 1996; Smith et al., 1997). 
This is because polyacrylamide gels cannot dis- 
criminate between DNA sequences that differ in 
size by less than about 0.2% (Sambrook et al., 
1989). High-resolution agarose gels such as those 
used in this work are even less sensitive, normally 
only discriminating products that differ in size by 
at least 1.5%. The use of the HA-red screening 
>tep enables resolution of identical or nearly iden- 
:ical sequences based on their AT content (Wawer 
?t al M 1995) and is sensitive down to < 1% differ- 
ence. Furthermore, it is rapid, technically simple 
ind does not require the use of radiolabels. 
3eisinger et al. (1997) originally demonstrated the 
jsefulness of using HA-red to identify different 
products cloned from the same band of an RNA 
differential display experiment by simultaneously 
unning them in normal agarose (to discriminate 
?y size) and in normal agarose containing HA-red 
to discriminate by AT content). We have found 
hat this approach is equally useful for identifying 
lifferent gene species cloned from the same band 
>f our SSH display. 

Diatchenko et al. (1996) reported that SSH is 
lighly efficient at producing differentially ex- 
>ressed gene species. However, we also included a 
econd screening step to further confirm that the 
lones isolated from the differential display were 
ndeed differentially expressed. Duplicate dotblots 
the candidate clones were blotted with the 
iisplay from which they were originally isolated 
-nd with the 'reverse subtraction' display. To 
nake the reverse-subtracted probe, the subtractive 
ybridisation step of the procedure was carried 
■ut using the original tester cDNA as a driver, 
nd the original driver cDNA as a tester. In this 
/ay, clones that are false positives can be iden- 
fied through their presence in both blots. Such 
alse positives most commonly arise through hav- 
lg a very high abundance in the initial sample or 
nusual hybridisation properties (Li et al., 1994). 



Although the SSH method itself has been 
shown to be efficient, and despite the screening 
step that we included, there is an important caveat 
to bear in mind — namely that it is important 
that all clones be considered only as 'candidates' 
until the actual abundance of their mRNA is 
quantitated in treated and control samples. To- 
wards this end, we examined the expression of a 
limited number of clones using semi-quantitative 
RT-PCR. Albumin was used as the reference gene 
as we have previously found that the expression 
of this gene does not appear to change with the 
treatment regime that we used (Fig. 4, and data 
not shown). There are a number of interesting 
points to note from our results. The first is the 
presence of genes that serve as appropriate posi- 
tive controls in the upregulated and downregu- 
lated series. For example, in the rat it can be seen 
that CYP4AI expression increases 14-fold follow- 
ing treatment. Although CYP4AI mRNA expres- 
sion levels following WY-14,643 treatment have 
not been previously reported in this model, the 
figure compares favourably with that recorded by 
Bell et al. (1991), who used RNAse-protection to 
quantitate CYP4A1 in rat liver following treat- 
ment with methylclofenapate, another PP. In ad- 
dition, we also confirmed that the peroxisomal 
enoyl-CoA:hydratase-3-hydroxyacyl-CoA Afunc- 
tional enzyme is also upregulated 9-fold, in agree- 
ment with the findings of Chen and Crane (1992). 

A number of genes were downregulated follow- 
ing Wy-14,643 exposure, including CYP2C11 ex- 
pression. Corton et al. (1997) reported similar 
findings and suggested that this may in part ex- 
plain why male rats exposed to Wy- 14,643 and 
some other PPs have high serum estradiol levels, 
as estradiol is a substrate for CYP2C1 1. We have 
also shown that the expression of contrapsin-like 
protease inhibitor (CLPI) was downregulated by 
Wy- 14,643. This has not previously been reported, 
and we suggest that it may be linked to a require- 
ment for increased availability of amino acids to 
accommodate the hepatomegaly induced by treat- 
ment. Although little is known of the function of 
parathymosin-a, (zinc 2 + -binding protein) it has 
been shown to interact with the globular domain 
of histone HI, suggesting a role in histone func- 
tion (Kondili et al., 1996). In contrast to the 
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Albumin 
Transferrin 



Albumin 
Transferrin 




Fig. 4. Semi-quantitative RT-PCR experiment showing relative decrease in expression of transferrin in treated rat liver (RT-1 to 
RT-4) compared to controls (RC-1 to RC-3). An equal amount of mRNA was used in each reaction (10 ng), and each sample was 
quantitated in triplicate to reduce the effects of inter-tube variation. N is negative control (no mRNA). Lane M is a 100 bp ladder 
and lane L is a 1 Kb DNA ladder. 



downregulation observed in this work, other stud- 
ies have shown that parathymosin-a expression is 
elevated in breast cancer (Tsitsilonis et al., 1993, 
1998), with the implication that parathymosin-a 
may somehow be involved in regulating cell pro- 
liferation by more than one mechanism. Transfer- 
rin has previously been shown to be 
downregulated in rat liver by hypolipidemic PPs 
(Hertz et al., 1996). It is therefore interesting to 
note that we isolated a clone identified as transfer- 
rin from the upregulated display profile. Since we 
confirmed by RT-PCR that transferrin is in fact 
downregulated in the rat (Fig. 4), we conclude 
that transferrin was either a false positive or was 
incorrectly identified. It could also be that we 
have isolated a close relative, splice variant or 
isofonr of transferrin, which demonstrates a dif- 
ferent expression profile under these experimental 
conditions. Further investigations are therefore 



required to determine which of these possibilities 
are correct. 

One of our most intriguing observations was 
that one gene, CD81, appeared to be upregulated 
in rat liver but downregulated in guinea pig liver 
following Wy- 14,643 exposure. CD81 is a widely 
expressed cell surface protein that is involved in a 
large number of cellular functions, including ad- 
hesion, activation, proliferation and differentia- 
tion (reviewed by Levy et al., 1998). Since all of 
these functions are altered to some extent in car- 
cinogenesis, it is perhaps an important observa- 
tion that CD81 expression is differentially 
regulated in a resistant and sensitive species ex- 
posed to a non-genotoxic carcinogen. 

Albumin and ribosomal genes appear common 
to all differential displays and are thus undesir- 
able false positives. However, due to their high 
expression in the liver, they are difficult to re- 
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move. We also noted a number of gene species, 
particularly in the guinea pig, which were com- 
mon to both upregulated and downregulated 
profiles. Again, the most likely reason for these 
having arisen is their high abundance. 

A relatively large number of upregulated and 
downregulated genes were isolated from guinea 
pig liver following Wy- 14,643 exposure. However, 
the guinea pig genome has been relatively poorly 
characterised and so many of the clones were 
identified as resembling genes or ESTs from other 
species. Without full-length sequence data it is 
difficult to ascertain the accuracy of the assigned 
^entities and this must be borne in mind when 
utilising data such as this, for example, in design- 
ng effective primers for RT-PCR studies. Al- 
hough the actual isolated clone sequences can be 
ased to do this, their relatively small size often 
estricts the ability to design effective primers. In 
iddition, as we observed with transferrin, using a 
published full-length sequence may help to iden- 
ify false positives. 



By comparing the expression profiles of genes 
showing altered expression in a PP-sensitive spe- 
cies (rat) with a PP-resistant species (guinea pig) % 
it was our aim to identify genes that are mecha- 
nistically relevant to the non-genotoxic hepatocar- 
cinogenic action of Wy-14,643. However, few of 
the genes that we have isolated were common to 
both the rat and the guinea pig. This suggests 
either that the molecular mechanisms of response 
in these two species are so different that few genes 
are commonly regulated in response to Wy-14,643 
exposure, or that we have recovered only a small 
proportion of those genes that have altered ex- 
pression. The latter seems the more likely scenario 
since it is perceived that one of the main problems 
of subtractive hybridisation and other differential 
expression technologies is the inability to consis- 
tently isolate rare gene transcripts (Bertioli et ah, 
1995). This is potentially problematic in that 
weakly expressed genes may play an important 
role in regulating key cellular processes, and that 
the majority of mRNA species are classified as 



Table 6 

iemi-quantitative RT-PCR analysis of selected gene species in the rat and guinea pig 8 



ranscnpt 



Putative change of expression following 
treatment according to dotblot 

Rat 



Change according to RT-PCR 
quantitation 



Guinea pig 


Rat 


Guinea pig 


N/A 


No change 


No change 


N/A 


Upregulated* (9x ) 


N/O 


N/A 


Downregulated* 


N/D 




(Abolished) 




N/A 


Upregulated* (14 x ) 


N/D 


Up 


No change 


N/O 


Down 


N/O 


Upregulated**(1.4 


N/A 


Downregulated** 


x ) 
N/D 




(0.5 x) 




N/A 


Downregulated** 


N/D 




(0.6 x) 




N/A 


Downregulated* 


No change 




(0.5 x) " 




N/A 


Downregulated** 


N/O 




(0.2 x) 




N/A 


No change (P = 0.06) 


N/D 


N/A 


No change 


N/O 



Ubumin 

lifunctional enzyme 
:YP2C11 

:yp4ai 

atalase 

:D8i (tapa-1) 



N/A 

Up 

Down 

Up 

N/A 

Up 



ontrapsin-like protease inhibitor Down 

arathymosin-a (zinc 2+ binding Down 
protein) 

ransferrin Up 

'DP-Glucuronosyl transferase Down 

>ownUnknown-l Down 

n-cc2-gIycoprotein Up 



a N/A, not applicable; N/O, not optimised; N/D, not done. 

* P< 0.0005; 

**P<0.05. 
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/rare' in abundance (Bertioli et al., 1995). How- 
ever, in their original paper describing the SSH 
technique, Gurskaya et al. (1996) demonstrated 
that SSH can enrich rare molecules between 1000- 
and 5000-fold in a single round of hybridisation. 
Unfortunately, due to high background smearing 
in our initial experiments (which hindered identifi- 
cation of single bands), we were compelled to 
reduce the primary hybridisation time to only 4 h 

a step that theoretically is likely to reduce the 

number of rare sequences (CLONTECHniques, 
1996). Furthermore, it has been claimed by the 
manufacturers that, whilst this technique can 
identify changes as small as 1.5-fold between the 
driver and tester populations, it is best suited to 
the isolation of genes that show a greater than 
5-fold increase (CLONTECHniques, 1996). In ad- 
dition, where tester and driver contain genes with 
large and small differences in abundance, the SSH 
method will be biased towards identifying those 
genes with the large differences (CLONTECH- 
niques, 1996). Thus, it is most probable that we 
have not isolated all of the more rarely expressed 
transcripts and those demonstrating small changes 
in expression. 

One problem that remains is identifying the 
function of genes isolated in SSH experiments as 
described herein, some of which may be crucial to 
the process of carcinogenesis, and are, to date, 
unidentified. However, we have provided evidence 
herein that SSH can be used to begin the process 
of characterising the extent and importance of 
altered gene expression in response to a chemical 
stimulus. The developments of this approach 
should include characterisation of temporal and 
dose responses, and functional analysis studies 
including knockout mice. In combination, such 
studies should make a significant contribution to 
our understanding of the molecular mechanisms 
of action and physiological relevance of gene reg- 
ulation in non-genotoxic hepatocarcinogenesis. It 
should then be. possible to ascertain whether dif- 
ferentially expressed genes are causally or casually 
related to the chemical-induced toxicity, and 
therefore a substantial mechanistic advance. 

It is clear that there are also broader applica- 
tions for this experimental approach that go be- 
yond understanding the molecular mechanisms of 
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peroxisome-proliferator induced non-genotoxic 
hepatocarcinogenesis in rodents. The potential 
medical and therapeutic benefits of elucidating the 
molecular changes that occur in any given cell in 
progressing from the normal to the carcinogenic 
(or other diseased, abnormal or developmental) 
state are very substantial. Notwithstanding the 
lack of complete functional identification of al- 
tered gene expression, such gene profiling studies 
described herein essentially provides a 'fingerprint' 
of each stage of carcinogenesis, and should help in 
the elucidation of specific and sensitive biomark- 
ers for different types of cancer. Amongst other 
benefits, such fingerprints and biomarkers could 
help uncover differences in histologically identical 
cancers, and provide diagnostic tests for the earli- 
est stages of neoplasia. In addition, the genes 
identified by this approach may be incorporated 
into gene-chip DNA-arrays, thus providing a 
standard genetic fingerprint for a particular toxin 
treatment in a particular species. Interrogation of 
these gene arrays for an unknown compound that 
has a similar pattern to the known reference 
chemical would then provide evidence that the 
unknown may have a toxicity profile similar to 
the 'standard' fingerprint, thereby serving as a 
mechanistically relevant platform for further de- 
tailed investigations. 



Acknowledgements 

This work was funded by Rhone-Poulenc Agro- 
chemicals, Nice, France. 



References 

Anderson, N.L., Esquer-Blasco, R., Richardson, F. t Foxwor- 
thy, P., Eacho, P., 1996. The effects of peroxisome prolifer- 
ators on protein abundances in mouse liver. Toxicol. Appl. 
Pharmacol. 137, 75-89. 

Ashby, J., 1992. Prediction of non-genotoxic carcinogenesis. 
Toxicol. Lett. 64-65, 605-612. 

Bell, DR., Bars, R.G., Gibson, G.G., Elcombe, C.R., 1991. 
Localisation and differential induction of cytochrome 
P450IVA and acyl coA oxidase in rat liver. Biochem. J. 
275, 247-252. 

Bertioli, D.J., Schlichter, U.H.A., Adams, MJ., Burrows, 
P.R., Steinbiss, H.-H., Antoniw, J.F., 1995. An analysis of 



'28 



J.C. Rockett et al. / Toxicology 144 (2000) 13-29 



differential display shows a strong bias towards high copy 
number mRNAs. Nucleic Acid Res. 23 (21), 4520-4523. 
Cattley, R.C, Rato, M., Popp, J.A., Teets, V.J., Voss, R.S., 
1994. Initiator-specific promotion of hepatocarcinogenesis 
by Wy-14,643 and clofibrate, Carcinogenesis 15 (8), 1763- 
1766. 

Chen, N., Crane, D.I., 1992. Induction of the major integral 
membrane protein of mouse liver peroxisomes by perox- 
isome proliferators. Biochem J. 283, 605-610. 
CLONTECHniques, 1996. Technical Tips: Clontech PCR-Se- 

lect cDNA Subtraction, October 25, application notes. 
CLONTECHniques, 1997. PCR-Select Differential Screening 
Kit — The Next Step After Clontech PCR-Select cDNA 
Subtraction. XII(2), 18-19, application notes. 
Corton, J.C., Bocos, C, Moreno, E.S., Merrit, A., Cattley, 
R.C, Gustaffson, J.A., 1997. Peroxisome proliferators al- 
ter the expression of estrogen-metabolising enzymes. 
Biochimie 79, 151-162. 
Davis, M., Cohen, D.I., Nielson, E.A., Steinmetz, M, Paul, 
W.E., Hood, L., 1984. Cell -type-specific cDNA probes and 
the murine I region: the localisation and orientation of 
Ad/a. Proc. Natl. Acad. Sci USA 81, 2194-2198. 
Diatchenko, L., Lau, Y.-F.C, Campbell, A.P., Chenchik, A., 
Moqadam, F., Huang, B., Lukyanov, K., Gurskaya, N., 
Sverdlov, E.D., Siebert, P.D, 1996. Suppression sub'trac- 
tive hybridisation: a method for generating differentially 
regulated or tissue-specific cDNA probes and libraries. 
Proc. Natl. Acad. Sci. USA 93, 6025-6030. 
Duguid, J., Dinauer, M., 1990. Library subtraction of in vitro 
cDNA libraries to identify differentially expressed genes in 
scrapie infection. Nucleic Acid Res. 18 (9), 2789-2792. 
Geisinger, A., Rodriguez, R., Romero, V., Wettstein, R., 1997. 
A simple method for screening cDNAs arising from the 
cloning of RNA differential display bands. Elsevier trends 
journals technical tips online, http://tto.trends.com, docu- 
ment number T01110 
Guimaraes, M.J., Lee, F., Zlotnik, A., McClanahan, T., 1995. 
Differential display by PCR: novel findings and applica- 
tions. Nucleic Acid Res. 23 (10), 1832-1833. 
Gurskaya, N.G., Diatchenko, L., Chenchik, P.D., Siebert, 
P.D., Khaspekov, G.L., Lukyanov, K.A., Vagner, L.L., 
Ermolaeva, O.D., Lukyanov, S.A., Sverdlov, E.D., 1996. 
Equalising cDNA subtraction based on selective suppres- 
sion of polymerase chain reaction: cloning of Jurkat cell 
transcripts induced by phytohemaglutinin and phorbol 12- 
myrystate 13-acetate. Anal. Biochem. 240, 90-97. 
Hayashi, F., Tamura, H., Yamada, J., Rasai, H., Suga, T., 
1994. Characteristics of the hepatocarcinogenesis caused by 
dehydroepiandrosterone, a peroxisome proliferator, in 
male F-344 rats. Carcinogenesis 15 (190), 2215-2219. 
Hedrick, S.M., Cohen, D.I., Nielsen, E.A., Davis, MM., 1984. 
Isolation of cDNA clones encoding T cell-specific mem- 
brane-associated proteins. Nature 308 (8), 149-153. 
Hertz, R., Seckbach, M, Zakin, M.M., Bar-Tana, J., 1996. 
Transcriptional suppression of the transferin gene by hy- 
polipidemic peroxisome proliferators. J. Biol Chem 271 
(1), 218-224. 



Hubank, M., Schatz, D.G., 1994. Identifying differences in 
mRNA expression by representational, difference analysis 
Nucleic Acid Res. 22 (25), 5640-3648. 

Human and Experimental Toxicology, 1994. Hum. Exp. Toxi- 
col. 13 (Suppl. 2) (entire issue). 

Rolls, J., Dsininger, P., Cohen, C, Larson, J., 1993. cDNA 
equalisation for reverse transcription-polymerase chain re- 
action quantitation. Anal. Biochem 208, 264-269. 

Rondih\ R., Tsoias, O., Papamarcaki, T., 1996. Selective 
interaction between parathymosin and histone HI. Eur J 
Biochem. 242 (1), 67-74. 

Lake, B.G., Evans, J.G., Gray, T.J.B., Rorosi, S.A., North, 
C.J., 1989. Comparative studies of nafenopin-induced her> 
atic peroxisome proliferation in the rat, Syrian hamster, 
guiea pig and marmoset. Toxicol. Appl. Pharmacol 99* 
148-160. 

Lake, B.G., Evans, J.G., Cunninghame, M.E., Price, R.J., 

1993. Comparison of the hepatic effects of Wy-14,643 on 
peroxisome proliferation and cell replication in the rat and 
Syrian hamster. Environ. Health Perspect. 101 (S5), 241- 
248. 

Levy, S., Todd, S.C., Maecker, H.T., 1998. CD81 (TAPA-1): a 
molecule involved in signal transduction and cell adhesion 
in the immune system. Annu. Rev. Immunol. 16, 89-109. 

Li, W.B., Gruber, C.E., Lin, J.J., D'Alessio, J.M., Jessee, J.A., 

1994. The isolation of differentially expressed genes in 
fibroblast growth factor stimulated BC3H1 cells by sub- 
tractive hybridization. BioTechniques 16, 722-729. 

Liang, P., Pardee, A.B., 1992. Differential display of eukary- 
otic messenger RNA by means of the polymerase chain 
reaction. Science 257 (5072), 967-971. 

Liang, P., Averboukh, L., Reyomarsi, R., Sager, R., Pardee, 
A.B., 1992. Differential display and cloning of messenger 
RNAs from human breast cancer versus mammary epithe- 
lium. Cancer Res. 52, 6966-6968. 

Lipman, D.J., Pearson, W.R., 1985. Rapid and sensitive 
protein similarity searches. Science 227, 1435-1441. 

Makowska, J.M., Gibson, G.G., Bonner, F.W., 1992. Species 
differences in ciprofibrate induction of hepaic cytochrome 
P450IVA1 and peroxisome proliferation. J. Biochem. Toxi- 
col. 7, 183-191. 

Marsman, D.S., Cattley, R.C, Conway, J.G., Popp, J.A., 
1988. Relationship of hepatic peroxisome proliferation and 
replicative DNA synthesis to the hepatocarcinogenicity of 
the peroxisome proliferators di-(2-ethylhexyl)phthalate and 
[4-chIoro-6-(2,3-xylidino)-2-pyrimidinylthio]acetic acid 
(Wy-14,643) in rats. Cancer Res. 48, 6739-6744. 
Mathieu-Daude, F., Cheng, R., Welsh, J., McClelland, M., 
1996. Screening of differentially amplified cDNA products 
from RNA arbitrarily primed PCR fingerprints using sin- 
gle strand conformation polymorphism (SSCP) gels. Nu- 
cleic Acid Res. 24 (8), 1504-1507. 
Orton, T.C, Adam, H.R., Bentley, M., Holloway, B., Tucker, 
M.J., 1984. Clobuzarit: species differences in the morpho- 
logical and biochemical response of the liver following 
chronic administration. Toxicol. Appl. Pharmacol 73 
138-151. 



J.C Rocket t et al/1 

* 

Parodi, S., 1992. Non-genotoxic factors in the carcinogenic 
process: problems of detection and hazard evaluation. 
Toxicol. Lett. 64-65, 621-630. 

Pearson, W.R., Lipman, D.J., 1988. Imported tools for biolog- 
ical sequence comparison. Proc. Natl. Acad. Sci. USA 85, 
2444-2448. 

Rockett, J.C., Esdaile, D.J., Gibson, G.G., 1997. Molecular 
profiling of non-genotoxic hepatocarcinogenesis using dif- 
ferential display reverse transcription-polymerase chain re- 
action (ddRT-PCR). Eur. J. Drug. Metab. Pharmacokinet 
22 (4), 329-333. 

Rodricks, J.V., Turnbull, D., 1987. Inter-species differences in 
peroxisomes and peroxisome proliferation. Toxicol. Ind. 
Health 3, 197-212. 

Sambrook, J., Fritsch, E.F., Maniatis, T., 1989. In: Ford, N M 
Nolan, C, Ferguson, M. (Eds.), Molecular Cloning — A 
Laboratory Manual, second ed. Cold Spring Harbor Labo- 
ratory Press, New York. 

Sargent, T., Dawid, I., 1983. Differential gene expression in 
the gastrula of Xenopus laevis. Science 222, 135-139. 

Smith, . N.R., Li, A., Aldersley, M., High, A.s., Markham, 
A.F., Robinson, P.A., 1997. Rapid determination of the 



ogy 144 (2000) 13-29 29 

complexity of cDNA bands extracted from DDRT-PCR 
polyacrylamide gels. Nucleic Acid Res. 25 (17), 3552-3554. 
Tsitsilonis, O.E., Stiakakis, J M Koutselinis, A., Gogas, J., 
Markopoulos, C, Yialouris, P., Bekris, S. t Panoussopou- 
los, D., Kiortsis, V., Voelten W., Haritos, A.A., 1993. 
Expression of alpha-thymosins in human tissues in normal 
and abnormal growth. Proc. Natl. Acad. Sci. USA 90 (20), 
9504-9507. 

Tsitsilonis, O.E., Bekris, E., Voutsas, I.F., Baxevanis, C.N., 
Markopoulos, C, Papadopoulou, S.A., Kontzoglou, K., 
Stoeva, S., Gogas, J., Voelter, W., Papamichail, M., 1998. 
The prognostic value of alpha-thymosins in breast cancer. 
Anticancer Res. 18 (3A), 1501-1508. 

Wan, J.S., Sharp, S.J., Poirier, G.M.-C, Wagaman, P.C., 
Chambers, J., Pyati, J., Horn, Y.-L., Galindo, J.E., Huvar, 
A., Peterson, P.A., Jackson, M.R., Erlander, M.G.. 1996. 
Cloning differentially expressed mRNAs. Nat. Biotechnol. 
14, 1685-1691. 

Wawer, C, Ruggeberg, H., Meyer, G., Muyzer, G., 1995. A 
simple and rapid electrophoresis method to detect sequence 
variation in PCR-amplified DNA fragments. Nucleic Acid 
Res. 23 (23), 4928-4929. 




aii international journal concerae#with 
the effect^ of chemic livmg system^ 
and immunotoxicologyj W | 





ELSEVIER 



Univ. of Minn. * 
Bio-Medical 
Library 

05 05 00 



Special Issue 
Festschrift dedicated to Professor Dr. K.J. Netter 



Exhibit F of Rockett Declaration i 
with Response dated 1 1/1 8/03 j 
In USSN: 09/867,958 ' 
PC-0041 CIP 

Proc NatL Acad. ScL USA , 
Vol. 94, pp. 13057-13062, November 1997 j 
Genetics 

Yeast microarrays for genome wide parallel genetic and gene 
expression analysis 

Deval A. LASHKARi't, Joseph L. DeRisi*, John H. McCusker§, Allen F. Namath*, Cristl Gentile§, 
Seung Y. Hwang*, Patrick 0. Brown*, and Ronald W. Davis** 

Departments of 'Genetics and ^Biochemistry, Stanford University, Stanford, CA 94305; and 5 Department of Microbiology, Duke University, Durham, NC 27710 
Contributed by Ronald W. Davis, September 2, 1997 



ABSTRACT We have developed high-density DNA mi- 
croarrays of yeast ORFs. These microarrays can monitor 
hybridization to ORFs for applications such as quantitative 
differential gene expression analysis and screening for se- 
quence polymorphisms. Automated scripts retrieved sequence 
information from public databases to locate predicted ORFs 
and select appropriate primers for amplification. The primers 
were used to amplify yeast ORFs in 96-well plates, and the 
resulting products were arrayed using an automated micro 
arraying device. Arrays containing up to 2,479 yeast ORFs 
were printed on a single slide, The hybridization of fluores- 
cently labeled samples to the array were detected and quan- 
titated with a laser confocal scanning microscope. Applica- 
tions of the microarrays are shown for genetic and gene 
expression analysis at tbe whole genome level. 



The genome sequencing projects have generated and will con- 
tinue to generate enormous amounts of sequence data. The 
genomes of Saccharomyces cerevisiae, Haemophilus influenzae (1), 
Mycoplasma genitalium (2), and Methanococcus jannischu (3) 
have been completely sequenced. Other model organisms have 
had substantial portions of their genomes sequenced as well 
including the nematode Caenorhabduis elegans (4) and the small 
flowering plant Arabidopsis thaliana (5). Given this ever- 
increasing amount of sequence information, new strategies are 
necessary to efficiently pursue the next phase of the genome 
projects— the elucidation of gene expression patterns and gene 
product function on a whole genome scale. 

One important use of genome sequence data is to attempt 
to identify the functions of predicted ORFs within the genome. 
Many f the ORFs identified in the yeast genome sequence 
were not identified in decades of genetic studies and have no 
significant homology to previously identified sequences in the 
database. In addition, even in cases where ORFs have signif- 
icant homology to sequences in the database, or have known 
sequence motifs (e.g., protein kinase), this is not sufficient to 
determine the actual biological role of the gene product. 
Experimental analysis must be performed to thoroughly un- 
derstand the biological function of a given ORFs product. 
Model organisms, such as S. cerevisiae, will be extremely 
important in improving our understanding of other more 
complex and less manipulable organisms. 

To examine in detail the functional role of individual ORFs and 
relationships between genes at the expression level, this work 
describes the use of genome sequence information to study large 
numbers f genes efficiently and systematically. The procedure 
was as follows, (i) Software scripts scanned annotated sequence 
information from public databases f r predicted ORFs. (it) The 
start and st p position f each identified ORF was extracted 
automatically, along with the sequence data of the ORF and 200 
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bases flanking either side, (iit) These data were used to automat- 
ically select PCR primers that would amplify the ORF. (rv) The 
primer sequences were automatically input into tbe automated 
multiplex oligonucleotide synthesizer (6). (v) The oligonucleo- 
tides were synthesized in 96-well format, and (vi) used in 96-well 
format to amplify the desired ORFs from a genomic DNA 
template, (vii) The products were arrayed using a high-density 
DNA arrayer (7-10). The gene arrays can be used for hybridiza- 
tion with a variety of labeled products such as cDNA for gene 
expression analysis or genomic DNA for strain comparisons, and 
genomic mismatch scanning purified DNA for genotyping (11). 

METHODS 

Script Design. All scripts were written in UNIX Tool Command 
Language. Annotated sequence information from GenBank was 
extracted into one file containing the complete nucleotide se- 
quence of a single chromosome. A second file contained the 
assigned ORF name followed by the start and stop positions of that 
ORF. The actual sequence contained within the specified range, 
along with 200 bases of sequence flanking both sides, was extracted 
and input into the primer selection program primer 0-5 (White- 
head Institute, Boston). Primers were designed so as to allow 
amplification of entire ORFs. The selected primer sequences were 
read by the 96-well automated multiplex oligonucleotide synthe- 
sizer instrument for primer synthesis. The forward and reverse 
primers were synthesized in two separate 96-well plates in corre- 
sponding wells. All primers were synthesized on a 20-nmol scale. 

ORF Amplification and Purification. Genomic DNA was iso- 
lated as described (12) and used as template for the amplification 
reactions. Each PCR was done in a total volume of 100 uL A total 
of 0.2 uM each of forward and reverse primers were aliquoted into 
a 96-well PCR plate (Robbins Scientific, Sunnyvale, CA); a master 
mix containing 0.24 mM each dNTP, 10 mM Tris (pH 8.5), 50 mM 
MgCl 2 , 25 units Taq polymerase, and 10 ng of template was added 
to the primers, and the entire mix was thermal cycled for 30 cycles 
as follows: 15 min at 94°Q 15 min at 54°Q and 30 min at 72°C 
Products were ethanol precipitated in polystyrene v-bottom 96- 
well plates (Costar). All samples were dried and stored at -20°C 

Arraying Procedure and Processing. Microarrays were 
made as described (8). 

A custom built arraying robot was used to print batches of 48 
slides. The robot utilizes four printing tips which simultaneously 
pick up -*1 u\ of solution from 96-well microliter plates. After 
printing, the microarrays were rehyd rated for 30 sec in a humid 
chamber and then snap dried for 2 sec on a hot plate (100°C). The 
DNA was then UV crosslinked to the surface by subjecting the 
slides to 60 millijoules of energy. The rest of the pory-L-lysine 
surface was blocked by a 15-min incubation in a solution of 70 mM 
succinic anhydride dissolved in a solution consisting of 315 ml of 
l-methyl-2-pyrrolidinone (Aldrich) and 35 ml of 1 M boric acid 
(pH 8.0). Directly after the blocking reaction, the bound DNA 
was denatured by a 2-min incubation in distilled water at ~95°C. 



Abbreviation: YEP, yeast extract/peptone. 
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Fig. 1. Two-color fluorescent scan of a yeast microarray contain- 
ing 2,479 elements (ORFs). The centcr-to-center distance between 
elements is 345 pm. A probe mixture consisting of cDNA from yeast 
extract/peptone (YEP) galactose (green pseudocolor) and YEP glu- 
cose (red pseudocolor) grown yeast cultures was hybridized to the 
array. Intensity per element corresponds to ORF expression, and 
pseudocolor per element corresponds to relative ORF expression 
between the two cultures. 

The slides were then transferred into a bath of 100% ethanoJ at 
room temperature. 

Probe Preparation: cDNA. Yeast cultures (100 ml) were grown 
to ~1 OD A 6oo and total RNA was isolated as described (13). Up 
to 500 ii% total RNA was used to isolate mRNA (Qiagen, 
Chatsworth, CA). Oligo(dT)20 (5 Mg) was added and annealed to 
2 ng of mRNA by heating the reaction to 70°C for 10 min and 
quick chilling on ice, plus 2 /il Superscript II (200 units//d) (Life 
Technologies, Gaithersburg, MD), 0.6 /d 50 X dNTP mix (final 
concen trations were 500 jiM dATP, dCTP, dGTP, and 200 jiM 
dTTP), 6 /J 5x reaction buffer, and 60 jiM Q3-dUTP or 
Cy5-dUTP (Amersham). Reactions were carried out at 42°C for 
2 h, after which the mRNA was degraded by the addition of 0.3 
Ml 5 M NaOH and 03 jd 100 mM EDTA and heating to 65°C for 
10 min. The sample was then diluted to 500 pd with TE and 
concentrated using a Microcon-30 (Amicon) to 10 /il. 

Probe Preparation: Genomic DNA. Fluorescent DNA was 
prepared from total genomic DNA as follows: 1 of random 
nonamer oligonucleotides was added to 2.5 fig of genomic 
DNA. This mixture was boiled for 2 min and then chilled on 
ice. A reaction mixture containing dNTPs (25 dATP 
dCTP, dGTP, 10 /iM dTTP, and 40 /iM Cy3-dUTP or 
Cy5-dUTP) reaction buffer (New England Biolabs), and 20 
units exonuclease free Klenow enzyme (United States Bio- 
chemical) was added, and the reaction was incubated at 37°C 
for 2 h. The sample was then diluted to 500 j&] with TE and 
concentrated using a Microcon-30 (Amicon) to 10 pj. 

Hybridization. Purified, labeled probe was resuspended in 1 1 
/d of 35 x SSC containing 10 ^Escherichia coli tRNA, and 0.3% 
SDS. The sample was then heated for 2 min in boiling water, 
cooled rapidly to room temperature, and applied to the array. The 
array was placed in a sealed, humidified, hybridization chamber. 
Hybridization was carried out for 10 h in a 62°C water bath, after 
which the arrays were washed immediately in 2x SSC/0.2% SDS. 
A second wash was performed in 0.1 X SSC 

Analysis and Quantitation. Arrays were scanned on a 
scanning laser fluorescence microscope developed by Steve 
Smith with software written by Noam Ziv (Stanford Univer- 



sity). A separate scan was done for ach of the two fluoro- 
phores used. The images were then combined for analysis, A 
bounding box, fined to the size of the DNA spots, was placed 
over each array element. The average fluorescent intensity was 
calculated by summing the intensities of each pixel present in 
a bounding box and then dividing by the total number of pixels. 
Local area background was calculated for each array element 
by determining the average fluorescent intensity at the edge of 
the bounding box. To normalize for f luorophore-specific vari- 
ation, control spots containing yeast genomic DNA were 
applied to each quadrant during the arraying process. These 
elements were quantitated and the ratios of the signals were 
determined. These ratios were then used to normalize the 
photomultiplier sensitivity settings such that the ratios of the 
fluorescence of the genomic DNA spots were close to a value 
of 1.0. The average signal intensity at any given spot was 
regarded as significant if it was at least two standard deviations 
above background. Each experiment was conducted in dupli- 
cate, with the fluorophores representing each channel re- 
versed. The ratios presented here are the average of the two 
experiments, except in the case in which the signal for the 
element in question was below the reliability threshold. The 
reliability threshold also determined the dynamic range of the 
experiment. For all of the experiments presented, the average 
dynamic range was M to 100. In the case where the fluores- 
cence from a very bright spot saturates the detector, differ- 
ential ratios will, in general, be underestimated. This can be 
compensated for by scanning at a lower overall sensitivity. 

RESULTS 

The accumulation of sequence information from model organ- 
isms presents an enormous opportunity and challenge to under- 
stand the biological function of many previously uncharacterized 
genes. To do this accurately and efficiently, a directed strategy 
was developed that enables the monitoring of multiple genes 
simultaneously. Microarraying technology provides a method by 
which DNA can be attached to a glass surface in a high-density 
format (8). In practice, it is possible to array over 6,000 elements 
in an area less than 1.8 cm 2 . Given that the yeast genome consists 
of ~6,100 ORFs, the enure set of yeast genes can be spotted onto 
a single glass slide. 

With this capability and the availability of the entire se- 
quence of the yeast genome, our strategy was to use a directed 
approach for generating the complete genome array. This 
procedure involved synthesizing a pair of oligonucleotide 
primers to amplify each ORF. The PCR product containing 
each gene of interest was arrayed onto glass and used, for 
example, as probe for monitoring gene expression levels by 
hybridizing to the array labeled cDNA generated from isolated 
mRNA of a culture grown under any experimental condition. 

Primer Selection and Synthesis. The primer selection was fully 
automated using Tool Command Language scripts and primer 
0.5. (Whitehead). Primer pairs were automatically selected suc- 
cessfuDy for >99% of the ORFs tested. Primer sequences can thus 
be selected rapidly with minimal manual processing. A complete 
set of forward and reverse primers were selected initially for each 
ORF on chromosomes I, II, III, V, VI, VIII, IX, X, and XI. 
Primers for a representative set of ORFs (15% coverage) were 
chosen for the remaining chromosomes. With the release of the 
entire yeast genome sequence, the complete set of primers has 
now been selected. 

Because each ORF requires a unique pair of synthetic primers, 
a total of approximately 12,200 oligonucleotides will be required 
to individually amplify each target. This costly component was 
addressed with the automated multiplex oligonucleotide synthe- 
sizer (6) which efficiently synthesizes primers in a 96-welI format 
Each primer, synthesized on a 20-nmoJ scale, provides enough 
material for 100 amplification reactions, whereas a given PCR 
product provides enough material to generate an element on 
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Table 1. Heat shock vs. control expression data 
Ratio of 



gene expression 



Control 


Neat 


ORF 


Gene 




2JZ 


YLR142 


PUT1 




2.0 


YOL140 


ARG8 


23 




YCL148 


AR02 




36.0 


YFL014 


HSP12 




27.4 


YBR072 


HSP26 




6.7 


YBRQ54 


YR02 




3.4 


YCR021 


HSP30 




2.6 


YER103 


SSA4 




23 


YLR259 


HSP60 




2.1 


YBR169 


SSE2 




1.7 


YBL075 


SSA3 




1.4 


YPL240 


HSP82 




1.4 


YDR258 


HSP78 


1.0 




YNL007 


SIS1 


1.1 




YEL030 




1.9 




YHR064 






1.3 


YBL0Q8 


HIR1 


2.6 




YBL002 


HTB2 


33 




YBL003 


HTA2 


3.3 




YBR010 


HHT1 


3.9 




YBR009 


HHF1 




2.4 


YDR343 


HXT6 




2.1 


YHR092 


HXT4 


3.6 




YAR071 


PHOll 




23 


YLR096 


KIN2 


2.5 




YER102 


RPS8B 


2.6 




YBR181 


RPS101 


2.6 




YCR031 


CRY1 


2.7 




YLR441 


RP10A 


2.8 




YHR141 


RPL41B 


2.8 




YBL072 


RPS8A 


2.8 




YHL015 


URP2 


2.8 




YBR191 


URP1 


3.1 




YLR340 


RPLAO 


3.3 


5.8 


YGL123 
YLR194 


SUP44 



Description 



Proline oxidase — ^— — — 
Acetylornithine aminotransferase 
Chorismatc synthase 
Heat shock protein 
Heat shock protein 

SimUarity to HSP30 heat shock protein Yrolp 
Heat shock protein 
Heat shock protein 

Mitochondrial heat shock protein HSP60 
Heat shock protein of the HSP70 family 
Cytoplasmic heat shock protein 
Heat shock protein 

Mitochondrial heat shock protein of clpb family of ATP-dependcnt proteases 

Heat shock protein 

70-kDa heat shock protein 

Heat shock protein 

Historic transcription regulator 

Historic H2BJ2 

Histonc H2A.2 

Histone H3 

Histone H4 

High-affinity hexose transporter 

Moderate- to low-affinity glucose transporter 

Secreted acid phosphatase, 56 kDa isozyme 

Ser/Thr protein kinase 

Ribosomal protein S8.e 

Ribosomai protein S6.e 
40S ribosomal protein Sl4.e 
Ribosomal protein S3.a.e 
Ribosomai protein L36a.e 
Ribosomal protein S8.e 
Ribosomal protein 
Ribosomai protein L21.e 
Acidic Ribosomal protein L10.e 
Ribosomal protein 
Hypothetical protein 



500-1,000 arrays. Thus, a single primer pair provides enough 
starting materia] for up to *»50,000 arrays. 

Primers were synthesized to amplify yeast ORFs. Primer 
synthesis had a failure rate of <\% in over 18 plates of 
synthesis as determined by standard trityl analysis (6). The 
success rate of the PCR amplifications using the primer pairs 
was 94% based on agarose gel analysis of each PCR. The 
purified PCR products were used to generate arrays. Two 
versions f the arrays were created for the experimental results 
presented here. The first array contained 2,287 elements and 
the sec nd array batch contained 2,479 elements. 

Genome Arrays. Hie amplified ORFs were arrayed onto glass 
at a spacing of 345 microns (Fig. 1). The high-density spacing of 
DNA samples allows the hybridization volumes to be mini- 
mized—volumes are a maximum of 10 fd. The labeled probe can 
thus be maintained at relatively high concentrations, making 1-2 
Hg of mRNA sufficient for analysis. This also obviates the need 
for a subsequent amplification step and thus avoids the risk of 
altering the relative ratios of different cDNA species in the 
sample. 

Genetic Analysis: Genomic Comparison of Unrelated Strains. 
Microarrays allow efficient comparison f the genomes of dif- 
ferent strains. Genomic DNA from Y55, an S. cerevisiae strain 
divergent from the reference strain S288c, was randomly labeled 
with Cy3-dUTP and hybridized simultaneously with the S288c 
DNA labeled with Cy5-dUTP. When a comparison between the 
hybridization of the DNA from the two strains was done, several 



elements gave relatively little or no signal above background from 
the Cy3 channel (data not shown). These include SGE1 
ASP3A-D, YLR156, YLR159, YLR161, ENA2 (YDR039 is' 
ENA2), and YCR105. These results imply that the regions 
containing these genes are extremely divergent, or all together 
deleted from the strain. Subsequent attempts to generate PCR 
products from SGE1, EN A 2, and ASP3A using Y55 DNA failed. 
This result supports the conclusion that these genes are likely to 
be missing from the Y55 genome. It is interesting to note that at 
least two of the regions absent in the Y55 genome have been 
previously shown or suggested to be deleted in mutant laboratory 
strains (14-16). In particular, the Asp-3 region appears to be 
highly prone to being deleted (15, 16). 

These results indicate that gene arrays can be used to efficiently 
screen different strains of an organism for large deletion poly- 
morphisms. A single hybridization and scan will reveal differences 
based on differential hybridization to particular elements. It is 
reasonable to suppose that an equivalent number of genes are 
present in the Y55 genome and absent in the S288c genome. This 
result should be viewed as a minimum estimate f the deletion 
polymorphisms that exist between these two unrelated strains as 
intergenic deletions or small intragenic deletions would n t be 
detected because considerable hybridizing material would be 
remain. Sequence polymorphisms, such as deletions, are present 
in populations of every species and must at some level affect 
phenotype. One of the challenges of the genome era will be to 
critically examine sequence polymorphisms that exist in the 
natural gene pool relative to the reference genome sequence. 
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Gene Expression Analysis. The arrays were used to examine 
gene expression in yeast grown under a variety of different 
conditions. Expression analysis is an idea] application of these 
arrays because a single hybridization provides quantitative expres- 

Tablc 2. Cold shock vs. control expression data 



Fig. 2. ORF categories displaying dif- 
ferential expression between heat shocked 
and untreated cultures. Ban within cate- 
gories correspond to individual ORFs. 
Green shaded bars correspond to relative 
increases in ORF expression under 25*C 
growth conditions. Red shaded bars cor- 
respond to relative increases in ORF ex- 
pression under 39°C growth conditions. 

sion data for thousands of genes. To better understand results for 
genes of known function, ORFs were placed in biologically rele- 
vant categories on the basis of function (e.g., amino acid catabolic 
genes) and/or pathways (e.g., the histidine biosynthesis pathway). 



Ratio of 
gene expression 



Control 


Cold 


ORF 


Gene 




33 


YOR153 


PDR5 


2.4 




YCR012 


PGK1 


2.9 


1.4 


YCL040 
YHR064 


GLK1 


2.0 




YJL034 


KAR2 


2.1 




YDR258 


HSP78 


22 




YLL039 


UBI4 


2.7 




YLL026 


HSP104 


3.1 




YER103 


SSA4 


33 




YBR126 


TPS1 


3.8 




YPL240 


HSP82 


7.9 




YBR054 


YR02 


7.9 




YBR072 


HSP26 


16J 




YCR021 


HSP30 


1.8 




YDR343 


HXT6 


2.1 




YHR096 


HXT5 


2.4 




YFR053 


HXK1 


2.8 




YHR092 


HXT4 


3.4 




YHR094 


HXT1 




23 


YHR089 


GAR1 




1.7 


YLR048 


NAB1B 




1.7 


YLR441 


RP10A 




1.7 


YLL045 


RPL4B 




1.6 


YLR029 


RPL13A 




1.6 


YGL123 


SUP44 




3.1 


YBR067 


TIP1 




2.2 _ 


YER011 


TIR1 




2.0 


YCR058 






4.2 


YKL102 





Description 



Plciotropic drug resistance protein 
Phosphoglycerate kinase 
Aldohexose specific glucokinase 
Heat shock protein 
Nuclear fusion protein 

Mitochondrial heat shock protein of clpb family of ATP-dependent proteases 
Ubiquitin precursor 
Heat shock protein 
Heat shock protein 

a, a-Trchalosc-phosphatc synthase (UDP-forming) 
Heat shock protein 

Similarity to HSP30 heat shock protein Yrolp 

Heat shock protein 

Heat shock protein 

High-affinity hexose transporter 

Putative hexose transporter 

Hexokinasc I 

Moderate- to low-affinity glucose transporter 
Low-affinity hexose (glucose) transporter 
Nucleolar rRNA processing protein 
40S ri bosom a! protein p40 homolog b 
Ribosomal protein S3a.c 
Ribosomal protein L7a.e.B 
Ribosomal protein L15.e 
Ribosomal protein 

Cold- and heat-shock-induced protein of the Srpl/Tiplp family 
Cold-shock-induced protein of the Tirlp, Tip lp family 
Hypothetical protein 
Hypothetical protein 
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Table 3. Glucose vs. galactose expression data 



Ratio of 
gene expression 



Glucose 


oaiaciosc 


ORF 




Description 


2.1 




YHR018 


ARG4 


Arginosucrinate lyase 


3.5 




VPR035 


GLN1 


Glutamate-ammonia ligase 


2.8 




YML116 


ATR1 


Aminotriazole and 4-nitroquinoline resistance protein 


2.0 




YMR303 


ADH2 


Alcohol dehydrogenase II 


3.7 




YBR145 


ADH5 


Alcohol dehydrogenase V 




3.2 


YBL030 


AAC2 


ADP, ATP carrier protein 2 




2.9 


YBR085 


AAC3 


ADP, ATP carrier protein 




2.7 


YDR298 


ATP5 


H + -transporting ATP synthase 6 chain precursor 




IS 


YBR039 


ATP3 


H* -transporting ATP synthase y chain precursor 




SJS 


YML054 


CYB2 


Lactate dehydrogenase cytochrome b2 




3.4 


YML054 


CYB2 


Lactate dehydrogenase cytochrome b2 




23 


YKL150 


MCR1 


Cytochrome-W reductase 




4.2 


YBL045 


COR1 


Ubiquinol-cytochrome c reductase 44K core protein 




3.5 


YDL067 


COX9 


Cytochrome c oxidase chain VI I A 




2.7 


YLR038 


COX12 


Cytochrome c oxidase, subunit VIB 




2.6 


YHR051 


COX6 


Cytochrome c oxidase subunit VI 




14 


YLR395 


COX8 


Cytochrome c oxidase chain VIII 




23 


YFR033 


OCR6 


Ubiquinol-cytochrome c reductase 17K protein 




23.7 


YLR081 


GAL2 


Galactose (and glucose) permease 




21.9 


YBR018 


GAL7 


UDP-glucose-hexose-1 -phosphate uridylyltransferase 




21.8 


YBR020 


GAL1 


Galactokinase 




19.5 


YBR019 


GAL10 


UDP-glucose 4-cpimerasc 




14.7 


YLR081 


GAL2 


Galactose (and glucose) permease 




8.6 


YDR009 


GAL3 


Galactokinase 




3.0 


YML051 


GAL80(1) 


Negative regulator for expression of gaiactose-induced genes 




2.8 


YML051 


GAL80(2) 


Negative regulator for expression of gaiactose-induced genes 


2.7 




YER055 


HIS1 


ATP phosphoribosyltransfcrasc 


3.4 




YBR248 


HIS7 


Glutamine amidotransferase/cydase 






YCL030 


HIS4 


Phosphoribosyl-AMP cyclohydrolasc/phosphoribosyl-ATP pyrophosphatasc/histidinol 


7.4 




dehydrogenase 


5.8 




YKR080 


MTD1 


Methyienetetrahydrofolate dehydrogenase (NAD+) 


6.0 




YDR019 


GCV1 


Glycine decarboxylase T subunit 


6.1 




YLR058 


SHM2 


Serine hydroxymethyltransferase 




8.1 


YML123 


PH084 


High-affinity inorganic phosphate/H* symporter 


3.5 




YDR408 


ADE8 


Phosphoribosylglycinamide formyltransferase (GART) 


3.6 




YDR408 


ADE8 


Phosphoribosylglycinamidc formyltransferase (GART) 


4.4 




YAR015 


ADE1 


Phosphoribosylamidoimidazolc-succinocarboxamidc synthase 


5.6 




YMR300 


ADE4 


Amidophosphoribosyltransfcrasc 


5.6 




YOR128 


ADE2 


Phosphoribosylaminoimidazole carboxylase 


6.0 




YGL234 


ADE5,7 


Phosphoribosylamine-glycine ligase and phosphoribosylforrnylglycinamidinc cydo-ligase 




6.3 


YBL015 


ACH1 


Acetyl-CoA hydrolase 



Heat Shock Results. A log phase culture growing in YEP/ 
dextrose medium at 25°C was split in half. One half of the 
culture remained at 25°C whereas the other half of the culture 
was shifted to 39°C. mRNA was isolated from both cultures 1 h 
after heat shock for comparison on microarrays and, although 
this time point is not optimal for measuring induction of heat 
shock mRNAs (17), many known heat shock genes exhibited 
considerable induction at this time point (Table 1; Fig. 2). 
Down-regulation of genes in the ribosomal protein and histone 
gene categories was also observed. Differential expression 
between the heat-shocked culture and the control was also 
observed for many other genes. Genes in many categories, such 
as amino acid catabolism and amino acid synthesis, exhibited 
a mixed response with some genes showing little or no 
differentia] expression and other genes showing a significant 
increase or decrease in gene expression in response to heat 
shock (Table 1; Fig. 2). 

Cold Shock Results. A log phase culture growing in YEP/ 
dextrose medium at 37°C was split in half. One half of the 
culture remained at 37°C while the other half of the culture was 
shifted to 18°C. mRNA was isolated from both cultures 1 h 
after cold shock for comparison on microarrays. As expected, 



two known cold shock genes (TIP1, TIR1) were expressed at 
a significantly higher level in the cold-shocked culture. Genes 
in other functional categories, such as glucose metabolism and 
heat shock displayed a mixed response with expression of some 
genes being unaffected and other genes exhibiting significant 
up- or down-regulation in response to cold shock (Table 2). 

Steady-State Galactose vs. Glucose Results. mRNA was 
isolated from steady-state log phase YEP galactose and YEP 
glucose grown cultures for comparison on the microarrays. As 
expected, the GAL genes were expressed at a much higher 
level in the galactose culture. Many genes were differentially 
expressed in these cultures that were not a priori expected to 
exhibit differential expression. For example, some genes in the 
amino acid catabolic category were up-regulated in the galac* 
tose culture whereas genes in the one-carbon metabolism and 
purine categories were largely or entirely down-regulated in 
the galactose culture (Table 3). Genes in other categories, such 
as amino acid synthesis, abc transporter, cytochrome c, and 
cytochrome b, exhibited mixed responses; some genes in a 
category showed little or no obvious differential expression 
whereas other genes in the same category showed significant 
differential expression in the galactose and glucose cultures. 
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DISCUSSION 
The results of these experiments show that many genes are 
differentially expressed under the three environmental condi- 
tions described here. The expected and predicted changes in gene 
expression, such as HSP12 in the heat-shocked culture, TTP1 in 
the cold-shocked culture, and GAL2 in the steady-state galactose 
culture, were observed in every case. However, in addition to the 
expected changes in gene expression, significant differential 
expression was also observed for many other genes that would 
not, a priori* be expected to be differentially expressed. For 
example, expression of PHOll decreased and expression of 
YLR194, KIN2, and HXT6 increased in the heat shocked culture. 
Expression of MST1 and APE3 decreased and expression of 
PDR5 and GAR1 increased in the cold-shocked culture. In 
addition, ADE4 and SER2 were expressed at reduced levels 
whereas PH084 and ACH1 were expressed at higher levels in 
cells grown in galactose compared with cells grown in glucose. 
Differential expression of these and many other genes was specific 
to one of these three environmental conditions. 

Many other genes were found to be differentially expressed 
under more than one condition. When differentially expressed 
genes in cold- and heat-shocked cultures were compared, 30 
genes were found in common. Of these 30 genes, 28 showed 
inverse expression (i.e., increased expression under one condition 
and decreased expression under the other condition). Two genes, 
YCRQ58 and YKL102, showed elevated expression in response to 
both cold and heat shock. Fifteen genes were found to be 
differentially expressed in both the heat-shocked and steady-state 
galactose cultures: 9 genes showed increased expression and 5 
showed decreased expression under both conditions. Twenty 
genes were differentially expressed in both the cold-shocked and 
steady-state galactose cultures: 8 genes showed decreased expres- 
sion and 5 genes showed increased expression under both con- 
ditions. Six genes showed increased expression in the galactose 
culture and decreased expression in the cold shocked culture. 
One gene (ODP1) showed increased expression in both the 
cold-shocked and steady-state galactose cultures. 

Gene expression is affected in a global fashion when environ- 
mental conditions are changed and both expected and unex- 
pected genes are affected. There is also overlap in the genes that 
are differentially expressed under quite different environmental 
conditions. These results can be rationalized by considering the 
high degree of cross-pathway regulation in yeast. For example, 
there is evidence for cross-pathway regulation between (i) carbon 
and nitrogen metabolism (18), (u) phosphate and sulfate metab- 
olism (19), and (iu) purine, phosphate, and amino acid metabo- 
lism (20-24). There are also examples of the interaction of 
general and specific transcription factors (25, 26). Finally, within 
the broad class of amino acid biosynthetic genes, there is evidence 
for amino acid specific regulation of some genes, regulation via 
general control for other genes, and regulation via both specific 
and general control for other genes (22, 27-30). 

Cross-pathway regulation arises from the complex structure 
of promoters. Virtually all promoters contain sites for multiple 
transcription factors and, therefore, virtually all genes are 
subject to combinatorial regulation. For example, the HIS4 
promoter contains binding sites for GCN4 (the general amino 
acid control transcription factor), PH02/BAS2 (a transcrip- 
tional regulator of phosphatase and purine biosynthetic 
genes), and BAS1 (a transcriptional regulator of purine bio- 
synthetic genes) (31). It is likely that the complex effects on 
gene expression described in this work are a direct conse- 
quence of the combinatorial regulation of gene expression. 

These findings illustrate the power of the highly parallel whole 
genome approach when examining gene expression. The global 
effects of environmental change on gene expression can now be 
directly visualized. It is clear that determining the mechanism(s) 
and the functional role of the dramatic global effects on gene 



expression in different environments will be a significant chal- 
lenge. The era of whole genome analysis win, ultimately, allow 
researchers to switch from the very focused single gene/promoter 
view of gene expression and instead view the cell more as a large 
complex network of gene regulatory pathways. 

With the entire sequence of this model organism known, new 
approaches have been developed that allow for genome wide 
analyses (32, 33) of gene function. The genome microarrays 
represent a novel tool for genetic and expression analysis of the 
yeast genome. This pilot study uses arrays containing >35% of 
the yeast ORFs and it is clear that the entire set of ORFs from 
the yeast genome can be arrayed using the directed primer based 
strategy detailed here. Recent advances in arraying technology 
will allow all 6,100 ORFs to be arrayed in an area of less than 1.8 
cm 2 . Furthermore, as the technology improves, detection limits 
will allow less than 500 ng of starting mRNA material to be used 
for making probe. 

The genome arrays provide for a robust, fully automated 
approach toward examining genome structure and gene func- 
tion. They allow for comparisons between different genomes 
as well as a detailed study of gene expression at the global level. 
This research will help to elucidate relationships between 
genes and allow the researcher to understand gene function by 
understanding expression patterns across the yeast genome. 

Support was provided by National Institutes of Health Grant 
P0/HG00205. 

1. Flcischminn, R. D„ Adams, M. D.. White. CL Clayton. R. A, Kirkocn, E. F-ef 
at. (1995) Science 269, 496-511 ^ N 

2 - _ C . M " S? 3 ^ In?" Whilc * °- Adanu - M * D - R- A., er «/. 

(1995) Science 270, 397-403. 

3. Bull. C J., White, O.. Ohen, C. J., Zhou. U Fleischnunn, R. Ductal 0996) 
Science 273, 1058-1073. ' 

4. Sulston, J., Du. r, Thomas, K., WiUon, R, Hillier. U a at. (1992) Nature 
(London) 356, 37. 

5. Newman. T. t dc Bruijn, F. J., Green. P., Keegstra. K_, Kendc. H, a at. (1 994) Plant 
Physiol. 106, 1241-1255. 1 ' 

6. Ushkari. D. A.. HunickcSmith, S. P.. Norgren. R. M„ Davis. R. W. & Brcnnan. 
T. (1995) Proc. Nad. Acad. ScL USA 92, 7912-7915. 

7. Schena. M., Shalon, D„ Davis, R. W. & Brown, P. O. (1995) Science 270, 467-470. 

8. Shalon, D., Smith. S. & Brown, P. O. (1996) Genome Rex. 6, 639-645 

9. Heller, R. A., Schena. M., Chai, A., Shalon, D., BedilioiwT., Gilmore, J 'wooUev 
D. E & Davis, R. W. (1997) Proc. Nad. Acad. ScL USA 94, 2150-2155 

10. DeRisi,J..Pcnland,Ubrown,P.O..Bittner,M.L-.Melt2er.P.S.,Ray,M Chen. 
Y_ Su Ya & Trent. J. M. (1996) Nat. Genet. 14, 457-460 

1 1. Nelson, S. F.. McCuskcr. J. H.. Sander, M. A. Kee, Y.. Modrich, P. A Brown P O 
(1993) Nat. Genet. 4, 1 1-18. 

11 Hoffman. C S. & Winston, F. (1989) Gene 84, 473-479. 

13. Schmitt, M.. Brown, T. & Trumpower. B. (1990) Nucieic Acids Res. 18. 3091 

14 ' ^287-294 MUn,y - A ' ^ Wur8ler * F E - * S^** 1 *** C 0 W«)*af. Gen. Gene*. 

15. Kim. K-W.. Kamcrud, J. O.. Livingston, D. M. A Roon, R. J. (1988)/ Biol Chan. 
263,11948-11953. 

16. Kim, K..W. & Roon. R. J. (1984) / Bacterid. 157, 958-961. 

17. Craig. E. A. (1992) in The Molecular Biology of the Yeast Saccharomycer Gene 
Expression, cds. Jones. E. W„ Pringlc. J. R. & Broach, J. R. (Cold Spring Harbor 
Lab. Press. Plainview, NY). Vol. 2, pp. 501-537. 

18. Dang. V. D.. Bonn, C, Bolotin-Fukuhara. M. & Daianan-Fomier B M996W 
Bacteriot. 1 78, 1 842-1 849. 1 ' 

19. O'Conncll. K. F. & Baker. R. E (1992) Genetics 132, 63-73 

2 °" 939 "w?" M0SCh ' H ' U "* V ° gCU **" H ' nncn ' A - * Hutler . R* 0989) EMBOJ. 8, 

21. Mosch, H. U.. Schcier, B.. Lahti. R.. Mantsala, P. &. Braus. C. H M991 \J Biol 
Chem. 266, 20453-20456. V ' 

22. Mitchell, A. P. & Magasanik. B. (1984) Mot. Celt. Biol. 4, 2767-2773 

23. Daignan-Fomicr. B. & Fink, G. R. (1992) Proc. Natl. Acad. Set USA 89 
6746-6750. * 

24. Ticc-Baldwin, K., Fink. G. R. 4 Arndi. K. T. (1989) Science 246, 931-935 

25. Mcsscnguy, F. & Dubois. E. (1993) Mot. CeU. Biol. 13, 2586-259Z 

26. Devlin. C. Ticc-Baldwin, K_, Shore. D. & Amdt, K. T. (1991) Mot. CeU Biol 11 
3642-3651. 

27. Magasan ik, B. (1992) in The Molecular and Cellular Biology of the Yeast Saccha- 
romyces: Gene Expression, cds. Jones, E W., Pringlc, J. R. & Broach, J. R (Cold 
Spring Harbor Lab. Press. Plainview, NY). Vol. 2, pp. 283-317 

28. Hinncbusch, A. G, (1992) in The Molecular and Cellular Biology of the Yeast 
Saccharomyces: Gene Expression, eds. Jones, E. W., Pringlc, J. R. & Broach J R. 
(Cold Spring Harbor Lab. Press, Plainview, NY). Vol. 2, pp 319-414 

29. Brisco, P. R. it Kohlhaw. C. B. (1990)/ Biol. Chem. 265, 11667-11675 

30. O'Conncll, K. F. ( Surdin-Kcrjan, Y. & Baker R. E (1995) Mot. Cell. Biol. 15, 
1879—1888. 

31. Arndt K. T., Styles, C & Fink, G. R. (1987) Science 237. 874-880. 

32 - j^' 1 !^' N - Lashkari, D.. Boutein, D. & Brown. P. O. (1996) Science 

33. Shoemaker, D. D., Lashkari. D. A., Morris, D.. Mittman, M. & Davis. R. W 
(1996) Ate/. Genet. 14, ; 450-456. ' 



Exhibit G of Rockett Declaration 

with Response dated 1 1/1 8/03 j 

In USSN: 09/867,958 | 

PC-0041 CIP 1 



Rscher-Vee. Science 270. 1626 (1995). 

35. T. C. James and S. C. Elgin, Mot. CeiBioi. 6. 3862 
(1986); R. Paro and D.S. Hogness^PmcNatLAcad. 
Set USA 88, 263 (1991); B. Tschiersch ef aL, 
EMBO J. 13. 3822(1994); M. T. MadrecfcSef at. Cat 
87. 75 (199Q; 0. G. Stokes. K. D. Tertof. R P. Peny, 
Prx. Natl. Acad. ScL USA 83. 7137 (1996). 

36. P. M. Pabsaari ef at. J. Bk)i. Chem. 266. 10750 
(1991); A. Schmftz, K. H. Gartemann, J. Fiedler. E. 



Gruxl, R. Bchenlaub. Appt. Environ. Microbiol 58. 
4066 (1992); V. Sharma, K. Suvama, R. Mega- 
nathan, M. E. Hudspeth. J. Bacterioi 174. 5057 

(1992) ; M. Kanazawa ef a/.. Enzyme Protein 47, 9 

(1993) ; Z. L Boynton, G. N. Bennet, F, 6. Rudofch, 
J. Bacterioi. 178. 3015(1996). 

37. M.Hoefai.Ceff77. 869 (1994). 

36. W. Hendrfes et a!. t J. Cell Bkxhem. 59. 418 (1995). 

39. We thank H. Skatetsky and F. Lewitter for help wth 



28 April 1997; accepted 9 September 1997 



Exploring the Metabolic and Genetic Control of 
Gene Expression on a Genomic Scale 

Joseph L DeRisi, Vishwanath R. Iyer, Patrick O. Brown* 

DNA microarrays containing virtually every gene of Saccharomyces cerevisiae were used 
to carry out a comprehensive investigation of the temporal program of gene expression 
accompanying the metabolic shift from fermentation to respiration. The expression 
profiles observed for genes with known metabolic functions pointed to features of the 
metabolic reprogramming that occur during the diauxic shift, and the expression patterns 
of many previously uncharacterized genes provided clues to their possible functions. The 
same DNA microarrays were also used to identify genes whose expression was affected 
by deletion of the transcriptional co-repressor TUP1 or overexpression of the transcrip- 
tional activator YAP1. These results demonstrate the feasibility and utility of this ap- 
proach to genomewide exploration of gene expression patterns. 



The complete sequences of nearly a dozen 
microbial genomes are known, and in the 
next several years we expect to know the 
complete genome sequences of several 
metazoans, including the human genome. 
Defining the role of each gene in these 
genomes will be a formidable task, and un- 
derstanding how the genome functions as a 
whole in the complex natural history of a 
living organism presents an even greater 
challenge. 

Knowing when and where a gene is 
expressed often provides a strong clue as to 
its biological role. Conversely, the pattern 
of genes expressed in a cell can provide 
detailed information about its state. Al- 
though regulation of protein abundance in 
a cell is by no means accomplished solely 
by regulation of mRNA, virtually all dif- 
ferences in cell type or state are correlated 
with changes in the mRNA levels of many 
genes. This is fortuitous because the only 
specific reagent required to measure the 
abundance of the mRNA for a specific 
gene is a cDNA sequence. DNA microar- 
rays, consisting of thousands of individual 
gene sequences printed in a high-density 
array on a glass microscope slide (J, 2), 
provide a practical and economical tool 
for studying gene expression on a very 
large scale (3-6). 

Saccharomyces cerevisiae is an especially 
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favorable organism in which to conduct a 
systematic investigation of gene expression. 
The genes are easy to recognize in the ge- 
nome sequence, cis regulatory elements are 
generally compact and close to the tran- 
scription units, much is already known 
about its genetic regulatory mechanisms, 
and a powerful set of tools is available for its 
analysis. 

A recurring cycle in the natural history 
of yeast involves a shift from anaerobic 
(fermentation) to aerobic (respiration) me- 
tabolism. Inoculation of yeast into a medi- 
um rich in sugar is followed by rapid growth 
fueled by fermentation, with the production 
of ethanol. When the fermentable sugar is 
exhausted, the yeast cells turn to ethanol as 
a carbon source for aerobic growth. This 
switch from anaerobic growth to aerobic 
respiration upon depletion of glucose, re- 
ferred to as the diauxic shift, is correlated 
with widespread changes in the expression 
of genes involved in fundamental cellular 
processes such as carbon metabolism, pro- 
tein synthesis, and carbohydrate storage 
(7). We used DNA microarrays to charac- 
terize the changes in gene expression that 
take place during this process for nearly the 
entire genome, and to investigate the ge- 
netic circuitry that regulates and executes 
this program. 

Yeast open reading frames (ORFs) were 
amplified by the polymerase chain reaction 
(PCR), with a commercially available set of 
primer pairs (8). DNA microarrays, con- 
taining approximately 6400 distinct DNA 
sequences, were printed onto glass slides by 



using a simple robotic printing device (9). 
Celb from an exponentially growing culture 
of yeast were inoculated into fresh medium 
and grown at 30°C for 21 hours. After an 
initial 9 hours of growth, samples were har- 
vested at seven successive 2-hour intervals, 
and mRNA was isolated (JO). Fluorescently 
labeled cDN A was prepared by reverse tran- 
scription in the presence of Cy3(green)- 
or Cy5(red)-labeled deoxyuridine triphos- 
phate (dUTP) (II) and then hybridized to 
the microarrays (12). To maximize the re- 
liability with which changes in expression 
levels could be discerned, we labeled cDNA 
prepared from cells at each successive time 
point with Cy5, then mixed it with a Cy3- 
labeled "reference" cDNA sample prepared 
from celb harvested at the first interval 
after inoculation. In this experimental de- 
sign, the relative fluorescence intensity 
measured for the Cy3 and Cy5 fluors at 
each array element provides a reliable mea- 
sure of the relative abundance of the corre- 
sponding mRNA in the two cell popula- 
tions (Fig. 1). Data from the series of seven 
samples (Fig. 2), consisting of more than 
43,000 expression-ratio measurements, 
were organized into a database to facilitate 
efficient exploration and analysis of the 
results. This database is publicly available 
on the Internet (13). 

During exponential growth in glucose- 
rich medium, the global pattern of gene 
expression was remarkably stable. Indeed, 
when gene expression patterns between die 
first two cell samples (harvested at a 2-hour 
interval) were compared, mRNA levels dif- 
fered by a factor of 2 or more for only 19 
genes (0.3%), and the largest of these dif- 
ferences was only 2.7-fold ( 14). However, as 
glucose was progressively depleted from the 
growth media during the course of the ex- 
periment, a marked change was seen in the 
global pattern of gene expression. mRNA 
levels for approximately 710 genes were 
induced by a factor of at least 2, and the 
mRNA levels for approximately 1030 genes 
declined by a factor of at least 2. Messenger 
RNA leveb for 183 genes increased by a 
factor of at least 4, and mRNA leveb for 
203 genes diminished by a factor of at least 
4. About half of these differentially ex- 
pressed genes have no currently recognized 
function and are not yet named. Indeed, 
more than 400 of the differentially ex- 
pressed genes have no apparent homology 
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to any gene whose function is known (15). 
The responses of these previously un char- 
acterized genes to the diauxic shift therefore 
provides the first small clue to their possible 
roles. 

The global view of changes in expres- 
sion of genes with known functions pro- 
vides a vivid picture of the way in which 
the cell adapts to a changing environ- 
ment. Figure 3 shows a portion of the yeast 
metabolic pathways involved in carbon 
and energy metabolism. Mapping the 
changes we observed in the mRNAs en- 
coding each eruyme onto this framework 
allowed us to infer the redirection in the 
flow of metabolites through this system. 
We observed large inductions of the genes 
coding for the enzymes aldehyde dehydro- 
genase (ALD2) and acetyl-coenzyme 
A(CoA) synthase (ACSJ), which func- 
tion together to convert the products of 
alcohol dehydrogenase into acetyl-CoA, 
which in turn is used to fuel the tricarbox- 
ylic acid (TCA) cycle and the glyoxylate 
cycle. Trie concomitant shutdown of tran- 
scription of the genes encoding pyruvate 
decarboxylase and induction of pyruvate 
carboxylase rechannels pyruvate away 
from acetaldehyde, and instead to oxalac- 
etate, where it can serve to supply the 
TCA cycle and gluconeogenesis. Induc- 
tion of the pivotal genes PCK1 , encoding 
phosphoenolpyruvate carboxykinase, and 
FBPJ, encoding fructose 1,6-biphos- 
phatase, switches the directions of two key 
irreversible steps in glycolysis, reversing 
the flow of metabolites along the revers- 
ible steps of the glycolytic pathway toward 
the essential biosynthetic precursor, glu- 
coses-phosphate. Induction of the genes 
coding for the trehalose synthase and gly- 
cogen synthase complexes promotes chan- 
neling of glucose-6-phosphate into these 
carbohydrate storage pathways. 

Just as the changes in expression of 
genes encoding pivotal enzymes can pro- 
vide insight into metabolic reprogram- 
ming, the behavior of large groups of func- 
tionally related genes can provide a broad 
view of the systematic way in which the 
yeast cell adapts to a changing environ- 
ment (Fig. 4). Several classes of genes, 
such as cytochrome c-related genes and 
those involved in the TCA/glyoxylate cy- 
cle and carbohydrate storage, were coord i- 
nately induced by glucose exhaustion. In 
contrast, genes devoted to protein synthe- 
sis, including ribosomal proteins, tRNA 
synthetases, and translation, elongation, 
and initiation factors, exhibited a coordi- 
nated decrease in expression. More than 
95% of ribosomal genes showed at least 
twofold decreases in expression during the 
diauxic shift (Fig. 4) (J3). A noteworthy 
and illuminating exception was that the 



genes encoding mitochondrial ribosomal 
genes were generally induced rather than 
repressed after glucose limitation, high- 
lighting the requirement for mitchondrial 
biogenesis (13). As more is learned about 
the functions of every gene in the yeast 
genome, the ability to gain insight into a 
cell's response to a changing environment 
through its global gene expression patterns 
will become increasingly powerful. 

Several distinct temporal patterns of ex- 
pression could be recognized, and sets of 
genes could be grouped on the basis of the 
similarities in their expression patterns. The 
characterized members of each of these 
groups also shared important similarities in 
their functions. Moreover, in most cases, 
common regulatory mechanisms could be 
inferred for sets of genes with similar expres- 
sion profiles. For example, seven genes 
showed a late induction profile, with mRNA 
levels increasing by more than ninefold at 
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the last timepoint but less than threefold at 
the preceding timepoint (Fig. 5B). AH of 
these genes were known to be glucose-re- 
pressed, and five of the seven were previously 
noted to share a common upstream activat- 
ing sequence (UAS), the carbon source re- 
sponse element (CSRE) (16-20). A search 
in the promoter regions of the remaining two 
genes, ACR1 and IDP2, revealed that 
ACR1, a gene essential for ACSJ activity, 
also possessed a consensus CSRE motif, but 
interestingly, 1DP2 did not. A search of the 
entire yeast genome sequence for the con- 
sensus CSRE motif revealed only four addi- 
tional candidate genes, none of which 
showed a similar induction. 

Examples from additional groups of 
genes that shared expression profiles are 
illustrated in Fig. 5, C through F. The 
sequences upstream of the named genes in 
Fig. 5C all contain stress response ele- 
ments (STRE), and with the excepti n 




Fig. 1. Yeast genome microarray. The actual size of the microarray is 18 mm by 18 mm The 
microarray was printed as described (9). This image was obtained with the same fluorescent 
scannmg confocal m.croscope used to collect all the data we report (49). A fluorescentry labeled 
P r0De was Prepared from mRNA isolated from ceils harvested shortly after inoculation (culture 
density of <5 x 10 6 cells/ml and media glucose level of 19 g/lrter) by reverse transcription in the 
presence of Cy3-dUTP. Similarly, a second probe was prepared from mRNA isolated from cells taken 
from the same culture 9.5 hours later (culture density of -2 x 10 8 cells/ml, with a glucose level f 
Z * 9 ^ f) by reVGrSe transc "P tion in tne Presence of Cy5-dUTP. In this image, hybridization of the 
Cy3-dUTP-labeled cDNA (that is. mRNA expression at the initial timepoint) is represented as a green 
signal, and hybridization of Cy5-d(JTP-tebeled cDNA (that is. mRNA expression at 9.5 hours) is 
represented as a red signal. Thus, genes induced or repressed after the diauxic shift appear in this 
image as red and green spots, respectively. Genes expressed at roughly equal levels before and after 
the diauxic shift appear in this image as yellow spots. 
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of HSP42, have previously been shown to 
be controlled at least in part by these 
elements {21-24). Inspection of the se- 
quences upstteam of HSP42 and the two 
uncharacterized genes shown in Fig. 5C, 
YKL026c, a hypothetical protein with 
similarity to glutathione peroxidase, and 
YGR043c, a putative rxansaldolase, re- 
vealed that each of these genes also pos- 
sess repeated upstream copies of the stress- 
responsive CCCCY motif. Of the 13 ad- 
ditional genes in the yeast genome that 
shared this expression profile [including 
HSP30, ALD2, OM45, and 10 uncharac- 
terized ORFs (25)], nine contained one or 
more recognizable STRE sites in their up- 
stream regions. 

The heterotrimeric transcriptional acti- 
vator complex HAP2 t 3A has been shown 
to be responsible for induction of several 
genes important for respiration (26-28). 
This complex binds a degenerate consensus 
sequence known as the CCAAT box (26). 
Computer analysis, using the consensus se- 
quence TNRYTGGB (29), has suggested 
that a large number of genes involved in 
respiration may be specific targets of 
HAP2,3 t 4 (30). Indeed, a putative 
HAP2,3,4 binding site could be found in 
the sequences upstream of each of the seven 
cytochrome c-related genes that showed 
the greatest magnitude of induction (Fig. 
5D). Of 12 additional cytochrome c-related 
genes that were induced, HAP2,3,4 binding 
sites were present in all but one. Signifi- 
cantly, we found that transcription of 
HAP4 itself was induced nearly ninefold 
concomitant with the diauxic shift. 

Control of ribosomal protein biogenesis 
is mainly exerted at the transcriptional 
level, through the presence of a common 
upstream-activating element (LIAS,—) 
that is recognized by the Rapl DNA-bind- 
ing protein (31 , 32). The expression pro- 
files of seven ribosomal proteins are shown 
in Fig. 5F. A search of the sequences 
upstream of all seven genes revealed con- 
sensus Rapl -binding motifs (33). It has 
been suggested that declining Rapl levels 
in the cell during starvation may be re- 
sponsible for the decline in ribosomal pro- 
tein gene expression (34). Indeed, we ob- 
served that the abundance of RAP J 
mRNA diminished by 4.4-fold, at about 
the time of glucose exhaustion. 

Of the 149 genes that encode known or 
putative transcription factors, only two, 
HAP4 and S1P4, were induced by a factor of 
more than threefold at the diauxic shift. 
SIP4 encodes a DNA-binding transcrip- 
tional activator that has been shown to 
interact with Snfl, the "master regulator" of 
glucose repression (35). The eightfold in- 
duction of S1P4 upon depletion of glucose 
strongly suggests a role in the induction of 



downstream genes at the diauxic shift. 

Although most of the transcriptional 
responses that we observed were not pre- 
viously known, the responses of many 
genes during the diauxic shift have been 
described. Comparison of the results we 
obtained by DNA microarray hybridiza- 
tion with previously reported results there- 
fore provided a strong test of the sensitiv- 
ity and accuracy of this approach. The 
expression patterns we observed for previ- 
ously characterized genes showed almost 
perfect concordance with previously pub- 
lished results (36). Moreover, the differ- 
ential expression measurements obtained 
by DNA microarray hybridization were re- 
producible in duplicate experiments. For 
example, the remarkable changes in gene 
expression between cells harvested imme- 
diately after inoculation and immediately 
after the diauxic shift (the first and sixth 
intervals in this time series) were mea- 
sured in duplicate, independent DNA mi- 
croarray hybridizations. The correlation 
coefficient for two complete sets of expres- 
sion ratio measurements was 0.87, and for 
more than 95% of the genes, the expres- 



sion ratios measured in these duplicate 
experiments differed by less than a factor 
of 2. However, in a few cases, there were 
discrepancies between our results and pre- 
vious results, pointing to technical limita- 
tions that will need to be addressed as 
DNA microarray technology advances 
(37, 38). Despite the noted exceptions, 
the high concordance between the results 
we obtained in these experiments and 
those of previous studies provides confi- 
dence in the reliability and thoroughness 
of the survey. 

The changes in gene expression during 
this diauxic shift are complex and involve 
integration of many kinds of information 
about the nutritional and metabolic state 
of the cell. The large number of genes 
whose expression is altered and the diver- 
sity of temporal expression profiles ob- 
served in this experiment highlight the 
challenge of understanding the underlying 
regulatory mechanisms. One approach to 
defining the contributions of individual 
regulatory genes to a complex program of 
this kind is to use DNA microarrays t 
identify genes whose expression is affected 



Fig. 2. The section of the ar- 
ray indicated by the gray box 
in Fig. 1 is shown for each of 
the experiments described 
here. Representative genes 
are labeled. In each of the ar- 
rays used to anafyze gene 
expression during the diauxic 
shift, red spots represent 
genes that were induced rel- 
ative to the initial timepoint, 
and green spots represent 
genes that were repressed 
relative to the initial timepoint. 
In the arrays used to analyze 
the effects of the tuplb mu- 
tation and YAP1 overexp res- 
sbn, red spots represent 
genes whose expression was 
increased, and green spots 
represent genes whose ex- 
pression was decreased by 
the genetic modification. Note 
that distinct sets of genes are 
induced and repressed in the 
different experiments. The 
complete images of each of 
these arrays can be viewed on 
the Internet (73). Cell density 
as measured by optical densi- 
ty (OD) at 600 nm was used to 
measure the growth of the 
culture. 
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by mutations in each putative regulatory 
gene. As a test of this strategy, we analyzed 
the genomcwidc changes in gene expression 
that result from deletion of the TUPl gene. 
Transcriptional repression of many genes by 
glucose requires the DNA-binding repressor 



Migl and is mediated by recruiting the tran- 
scriptional co-repressors Tupl and Cyc8/ 
Ssn6 (39). Tupl has also been implicated in 
repression of oxygen-regulated, mating-type- 
specific, and DNA-damage-inducible genes 
(40). 
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Fig. 3. Metabofic reprogramming inferred from global analysis of changes in gene expression. Only key 
metabolic intermediates are identified. The yeast genes encoding the enzymes that catalyze each step 
in this metabolic circuit are identified by name in the boxes. The genes encoding succinyl-CoA synthase 
and glycogen-debrancNng enzyme have not been explicitly identified, but the ORFs YGR244 and 
YPR184 show significant homology to known succinyl-CoA synthase and glycogen-debranching en- 
zymes, respectively, and are therefore included in the corresponding steps in this figure. Red boxes with 
white lettering identify genes whose expression increases in the diauxic shift. Green boxes with dark 
green lettering identify genes whose expression diminishes in the diauxic shift. The magnitude of 
induction or repression is indicated for these genes. For multimeric enzyme complexes, such as 
succinate dehydrogenase, the indicated fold-induction represents an unweighted average of aJI the 
genes listed in the box. Black and white boxes indicate no significant differential expression (less than 
twofold). The direction of the arrows connecting reversible enzymatic steps indicate the direction of the 
flow of metabolic intermediates, inferred from the gene expression pattern, after the diauxic shift. Arrows 
representing steps catalyzed by genes whose expression was strongly induced are highlighted in red. 
The broad gray arrows represent major increases in the flow of metabolites after the diauxic shift, 
inferred from the indicated changes in gene expression. 
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Wild-type yeast cells and cells bearing 
a deletion of the TUPl gene (tupl A) were 
grown in parallel cultures in rich medium 
containing glucose as the carbon source. 
Messenger RNA was isolated from expo- 
nentially growing cells from the two pop- 
ulations and used to prepare cDNA la- 
beled with Cy3 (green) and Cy5 (red), 
respectively (11), The labeled probes were 
mixed and simultaneously hybridized to 
the microarray. Red spots on the microar- 
ray therefore represented genes whose 
transcription was induced in the tupl A 
strain, and thus presumably repressed by 
Tupl (41). A representative section of the 
microarray (Fig. 2, bottom middle panel) 
illustrates that the genes whose expression 
was affected by the tup J A mutation, were, 
in general, distinct from those induced 
upon glucose exhaustion [complete images 
of all the arrays shown in Fig. 2 axe avail- 
able on the Internet (J3)J. Nevertheless, 
34 ( 10%) of the genes that were induced 
by a factor of at least 2 after the diauxic 
shift were similarly induced by deletion f 
TUP J, suggesting that these genes may be 
subject to TUP 1 -mediated repression by 
glucose. For example, SUC2 t the gene en- 
coding invertase, and all five hexose trans- 
porter genes that were induced during the 
course of the diauxic shift were similarly 
induced, in duplicate experiments, by the 
deletion of TUPL 

The set of genes affected by Tupl in this 
experiment also included a-glucosidases, 
the mating-type-specific genes MFAi and 
MFA2, and the DNA damage-induciblc 
RNR2 and RNR4, as well as genes involved 
in flocculation and many genes of unknown 
function. The hybridization signal corre- 
sponding to expression of TUPl itself was 
also severely reduced because of the (in- 
complete) deletion of the transcription unit 
in the tup] A strain, providing a positive 
control in the experiment (42). 

Many of the transcriptional targets of 
Tupl fell into sets of genes with related 
biochemical functions. For instance, al- 
though only about 3% of all yeast genes 
appeared to be TUP J -repressed by a factor 
of more than 2 in duplicate experiments 
under these conditions, 6 of the 13 genes 
that have been implicated in flocculation 
(15) showed a reproducible increase in 
expression of at least twofold when TUPl 
was deleted. Another group of related 
genes that appeared to be subject to TUPl 
repression encodes the serine-rtch cell 
wall mannoproteins, such as Tipl and 
Tirl/Srpl which are induced by cold 
shock and other stresses (43), and similar, 
serine-poor proteins, the seripauperins 
(44). Messenger RNA levels for 23 f the 
26 genes in this group were reproducibly 
elevated by at least 2.5-fold in the tup J A 
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strain, and 18 of these genes were induced 
by more than sevenfold when TUP1 was 
deleted. In contrast, none of 83 genes that 
could be classified as putative regulators of 
the cell division cycle were induced more 
than twofold by deletion of TUP1 . Thus, 
despite the diversity of the regulatory sys- 
tems that employ Tupl, most of the genes 
that it regulates under these conditions 
fall into a limited number of distinct func- 
tional classes. 

Because the microanray allows us to 
monitor expression of nearly every gene in 
yeast, we can, in principle, use this ap- 
proach to identify all the transcriptional 
targets of a regulatory protein like Tupl. It 
is important to note, however, that in any 
single experiment of this kind we can only 
recognize those target genes that are nor- 
mally repressed (or induced) under the 
conditions of the experiment. For in- 
stance, the experiment described here an- 
alyzed a MAT a strain in which MFAJ 
and MFA2, the genes encoding the a- 
factor mating pheromone precursor, are 
normally repressed. In the isogenic tup] A 
strain, these genes were inappropriately 
expressed, reflecting the role that Tupl 
plays in their repression. Had we instead 
carried out this experiment with a MA TA 
strain (in which expression of MFAJ and 
MFA2 is not repressed), it would not have 
been possible to conclude anything re- 
garding the role of Tupl in the repression 
of these genes. Conversely, we cannot dis- 
tinguish indirect effects of the chronic 
absence of Tupl in the mutant strain from 
effects directly attributable to its partici- 
pation in repressing the transcription of a 
gene. 

Another simple route to modulating the 
activity of a regulatory factor is to overex- 
press the gene that encodes it. YAP J en- 
codes a DNA-binding transcription factor 
belonging to the b-zip class of DNA-bind- 
ing proteins. Overexpression of YAP I in 
yeast confers increased resistance to hydro- 
gen peroxide, o-phenanthroline, heavy 
metals, and osmotic stress {45). We ana- 
lyzed differential gene expression between a 
wild-type strain bearing a control plasmid 
and a strain with a plasmid expressing YAPJ 
under the control of the strong GAL/ -JO 
promoter, both grown in galactose (that is, 
a condition that induces YAPJ overexpres- 
sion). Complementary DNA from the con- 
trol and YAPJ overexpressing strains, la- 
beled with Cy3 and Cy5, respectively, was 
prepared from mRNA isolated from the two 
strains and hybridized to the microarray. 
Thus, red spots on the array represent genes 
that were induced in the strain overexpress- 
ing YAPL 

Of the 17 genes whose mRNA levels 
increased by more than threefold when 



YAPJ was overexpressed in this way, five 
bear homology to aryl-alcohol oxidoreduc- 
tases (Fig. 2 and Table 1). An additional 
four of the genes in this set also belong to 
the general class of dehydrogenases/oxi- 
doreductases. Very little is known about 
the role of aryl-alcohol oxidoreductases in 
S. cerevisiae, but these enzymes have been 
isolated from ligninoiytic fungi, in which 
they participate in coupled redox reac- 
tions, oxidizing aromatic, and aliphatic 
unsaturated alcohols to aldehydes with the 
production of hydrogen peroxide (46, 47). 
The fact that a remarkable fraction of the 
targets identified in this experiment be- 
long to the same small, functional group of 
oxidoreductases suggests that these genes 

Fig. 4. Coordinated reg- 
ulation of functionally re- 
lated genes. The curves 
represent the average in- 
duction or repression ra- 
tios for all the genes in 
each indicated group. 
The total number of 
genes in each group was 
as follows: ribosomal 
proteins, 1 12; translation 
elongation and initiation 

factors. 25; tRNA synthetases (excluding mitochondiaJ synthetases). 17; glycogen and trehalose syn- 
thesis and degradation, 15; cytochrome c oxidase and reductase proteins, 19; and TCA- and glyoxy- 
late-cycle enzymes, 24. 

Table 1 . Genes induced by YAPl overexpression. This list includes all the genes for which mRNA levels 
increased by more than twofold upon YAPl overexpression in both of two duplicate experiments, and 
for which the average increase in mRNA level in the two experiments was greater than threefold (50). 
Positions of the canonical Yapl binding sites upstream of the start codon. when present, and the 
average fold-increase in mRNA levels measured in the two experiments are indicated. 



might play an important protective role 
during oxidative stress. Transcription of a 
small number of genes was reduced in the 
strain overexpressing Yapl. Interestingly, 
many of these genes encode sugar per- 
meases or enzymes involved in inositol 
metabolism. 

We searched for Yapl -binding sites 
(TTACTAA or TCACTAA) in the se- 
quences upstream of the target genes we 
identified (48). About two-thirds of the 
genes that were induced by more than 
threefold upon Yapl overexpression had 
one or more binding sites within 600 bases 
upstream of the start codon (Table 1 ), sug- 
gesting that they are directly regulated by 
Yapl. The absence of canonical Yapl-bind- 
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ing sites upstream of the others may reflect 
an ability f Yapl to bind sites that differ 
from the canonical binding sites, perhaps in 
cooperation with other factors, or less like- 
ly, may represent an indirect effect of Yapl 
overexpression, mediated by one or more 
intermediary factors. Yapl sites were found 
only four times in the corresponding region 
of an arbitrary set of 30 genes that were not 
differentially regulated by Yapl. 

Use of a DNA microarray to character- 
ize the transcriptional consequences of 
mutations affecting the activity of regula- 
tory molecules provides a simple and pow- 
erful approach to dissection and character- 
ization of regulatory pathways and net- 



works. This strategy also has an important 
practical application in drug screening. 
Mutations in specific genes encoding can- 
didate drug targets can serve as surrogates 
for the ideal chemical inhibitor or modu- 
lator of their activity. DNA microarrays 
can be used to define the resulting signa- 
ture pattern of alterations in gene expres- 
sion, and then subsequently used in an 
assay to screen for compounds that repro- 
duce the desired signature pattern. 

DNA microarrays provide a simple and 
economical way to explore gene expres- 
sion patterns on a genomic scale. The 
hurdles to extending this approach to any 
other organism are minor. The equipment 
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Fig. 5. Distinct temporal patterns of induction or repression help to group genes that share regulatory 
properties. (A) Temporal profite of the cell density, as measured by OD at 600 nm and glucose 
concentration in the media. (B) Seven genes exhibited a strong induction (greater than ninefold) only at 
the last timepoint (20.5 hours). With the exception of IDP2, each of these genes has a CSRE UAS. There 
were no additional genes observed to match this profite. (C) Seven members of a class of genes marked 
by earty induction with a peak in mRNA levels at 18.5 hours. Each of these genes contain STRE motif 
repeats in their upstream promoter regions, (D) Cytochrome c oxidase and ubiquinol cytochrome c 
reductase genes. Marked by an induction coincident with the diauxic shift, each of these genes contains 
a consensus binding motif for the HAP2.3.4 protein complex. At least 17 genes shared a similar 
expression profile. (E) SAM1, GPP1, and several genes of unknown function are repressed before the 
diauxic shift, and continue to be repressed upon entry into stationary phase. (F) Ribosomal protein 
genes comprise a large class of genes that are repressed upon depletion of glucose. Each of the genes 
profiled here contains one or more RAP1 -binding motifs upstream of its promoter. RAP1 is a transcrip- 
tional regulator of most ribosomal proteins. 



Reports 



required f r fabricating and using DNA 
microarrays (9) c nsists of components 
that were chosen for their modest cost and 
simplicity. It was feasible for a small group 
to accomplish the amplification of m re 
than 6000 genes in about 4 months and, 
once the amplified gene sequences were in 
hand, only 2 days were required to print a 
set of 1 10 microarrays of 6400 elements 
each. Probe preparation, hybridizati n, 
and fluorescent imaging are also simple 
procedures. Even conceptually simple ex- 
periments, as we described here, can yield 
vast amounts of information. The value f 
the information from each experiment of 
this kind will progressively increase as 
more is learned about the functions of 
each gene and as additional experiments 
define the global changes in gene expres- 
sion in diverse other natural processes and 
genetic perturbations. Perhaps the greatest 
challenge now is to develop efficient 
methods for organizing, distributing, inter- 
preting, and extracting insights from the 
large volumes of data these experiments 
will provide. 
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Human Genome Placed on Chip; Biotech Rivals Put It Up 
for Sale 



By ANDREW POLLACK (NYT) 1030 words 
The genome on a chip has arrived. 

Melding high technology with biology, several companies are rushing to sell slivers of glass or 
nylon, some as small as postage stamps, packed with pieces of all 30,000 or so known human 
genes. 

The new products will allow scientists to scan all genes in a human tissue sample at once, to 
determine which genes are active, a job that previously required two or more chips. The whole- 
genome chips will lower the cost and increase the speed of a widely used test that has 
transformed biomedical research in the last few years. 

"It's sort of a milestone event, very similar to generating an integrated circuit of the genome," 
said Stephen P. A. Fodor, the chief executive of Affymetrix Inc., the leading seller of gene chips, 
which are also called microarrays. 

Affymetrix, based in Santa Clara, Calif, is expected to announce today that it is accepting orders 
for its whole-genome chip. 

The announcement seems timed to steal some thunder from the rival Agilent Technologies, 
which is based in nearby Palo Alto. Agilent is to be the host of an analyst meeting today and it 
plans to announce then that it has started shipping test versions of its whole-genome chip. 

Applied Biosystems of Foster City, Calif., a unit of the Applera Corporation, started the race in 
July with an announcement that it would have a whole-genome chip out by the end of this year. 
NimbleGen Systems, a small company in Madison, Wis., announced a few days later that it had a 
genome on a chip that it was not selling but that it was using to run tests for customers. 

Gene chips, which detect genes that are active, meaning they are being used to make a protein, 
have become essential tools. Scientists try to understand the genetic mechanisms of disease by 
seeing which genes are turned on in, say, a sick kidney or lung compared with those active in a 
healthy organ. Pharmaceutical companies look at gene activity patterns to try to predict the 
effects of drugs. 



Scientists have found that tumors that look the same under the microscope can differ in terms of 
which genes are active. So studying gene patterns could become a way to discriminate between 
deadly and not-so-deadly tumors, or to predict which drug will work best for a particular patient. 

Still, even some vendors conceded that the change from two chips to one is more symbolic than 
revolutionary. 

"You can do just as good science with two chips, it costs you a little more," said Roland Green, 
the vice president for research and development at NimbleGen. 

Some scientists questioned whether the chips really have all human genes, because the exact 
number and identities of all the genes is not known. 

The advent of the genome on a chip is, however, evidence that biotechnology, to the extent that it 
uses electronics, is experiencing some of the rapid progress that has made semiconductors and 
computers continuously cheaper and smaller. 

"One of the effects everyone is looking for in the genomics area is Moore's law - more data, less 
money," said Doug Dolginow, an executive vice president at Gene Logic, which sells data from 
gene chip studies to pharmaceutical companies. "This is a step in that direction." 

Moore's law states that the number of transistors on a semiconductor chip doubles every 1 8 
months. 

Asymetrix's gene chips are, in fact, made with the same techniques used to make semiconductor 
chips. In the mid-1 990's, the company came out with a set of five chips covering what was then 
known of the human genome. After the human genome sequence was virtually completed in 
2000, the company developed a two-chip set with all the known genes. Now it has the single 
chip, which some scientists say will be more convenient. 

"We like to be able to look at all genes at one time to get a global view of what's going on," said 
John R. Walker, who runs gene chip operations at the Genomics Institute of the Novartis 
Research Foundation in San Diego. 

Costs should also be lower. Gene chips have been so expensive that many academic scientists 
still make their own rather than buy them. Affymetrix said it would sell its whole-genome chips 
for $300 to $500 each, depending on volume, little more than half the price of the two-chip set. 
The other companies have not announced prices. 

For Affymetrix, a successful whole-genome chip "is essential for them to maintain their 
dominance" of high-end microarrays, said Edward A. Tenthoff, an analyst at U.S. Bancorp Piper 
Jaffray. Affymetrix had total product sales in 2002 of about $250 million, and a company 
spokesman said that human genome chips are its top-selling product. 

Mr. Tenthoff, who recommends Affymetrix stock, said the company's sales growth rate had 
moderated as it faces tougher competition. Agilent, a spinoff of Hewlett-Packard that makes its 
gene chips by printing DNA components onto glass slides using ink jet printers, has gained 
share, he said. Applied Biosystems, the largest maker of genomics equipment over all, will be 



2 



entering the microarray segment of the business with its whole-genome chip, emphasizing the 
connection of that product to the others it offers, including the gene database developed by its 
sister company, Celera Genomics. 

Jeffrey Trent, scientific director of the Translational Genomics Research Institute in Phoenix, 
said that while whole-genome chips are useful for medical discovery, the biggest growth of tie 
market will be for chips that can be used by doctors to do diagnoses. And whole-genome chips 
are too cumbersome for that, he said. Rather, once scientists use the whole-genome chips to find 
particular genes that are associated with, say, tumor aggressiveness or drug effectiveness, he 
said, they will then make smaller and cheaper chips containing just those genes for use in 
diagnosis. 
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Agilent Technologies ships whole human genome on single 
microarray to gene expression customers for evaluation 

Company to introduce first commercial whole human microarray by end of year 
PALO ALTO, Calif., Oct. 2, 2003 
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Agilent Technologies Inc. (NYSE: A) today announced it has shipped whole human-genome microarrays 
to customers for testing and evaluation. The whole genome microarray is based on Agilent's new double- 
density format, which can accommodate 44,000 features on a single 1" x 3" glass-slide microarray The 
new platform enables drug-discovery and disease researchers to perform whole-genome screening at a 
lower cost and with higher reproducibility. y 

This is an important step toward our release of the first whole human-genome microarray product which 
is expected to be available for order before the end of the year," said Barney Saunders, vie* president 
and general manager of Agilent's BioResearch Solutions Unit. - Customers have long wanted a one- 
sample one-chip format with the increased sensitivity associated with 60-mer probes. The cost savings 
and high-quality performance make this product a compelling alternative for scientists who make their 
own microarrays. 

AgHenfs microarrays are based on the industry-standard 1" x 3" (25mm x 75mm) format, which is 
compatible with most commeraal microarray scanners. All Agilent commercial microarrays are developed 
using content from public databases and proprietary sources, with full sequence and annotation 
information made available to customers. Gene sequences for probes are developed using algorithms 
and then validated emp.ncally through iterative wet-lab testing procedures. The result is a microarray 
compnsed of functionally validated probes, with the most up-to-date and comprehensive genome 
information commercially available. 



Advantages of the double-density format include: 



• Lower cost. Not only is one microarray less expensive than two, it requires fewer reagents and 
reduces instrumentation demands. 

• Streamlined workflow. Researchers need prepare and process only one microarray instead of 
two. This also results in fewer steps in the subsequent data analysis. 

• Greater reproducibility. Use of a single microarray further reduces unnecessary variability in 
experimental conditions. 

• Smaller sample use. A smaller quantity of sample material is required to perform an experiment. 
Availability 

Agilent's Whole Human Genome Microarray is expected to be available for order by the end of the year. 



About Agilent T chnologi s 

Agilent Technologies Inc. (NYSE: A) is a global technology leader in communications, electronics life 
sciences and chemical analysis. The company's 30,000 employees serve customers in more than 110 
countnes. Agilent had net revenue of $6 billion in fiscal year 2002. Information about Agilent is available 
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on the Web at www.aQilent.oom . 



Forward-L oklng Statements 

This news release contains forward-looking statements (including, without limitation, statements relating 
to Agilent* s expectation that its whole-genome microarray platform will be available for order before the 
end of 2003) that involve risks and uncertainties that could cause results to differ materially from 
management's current expectations. These and other risks are detailed in the company's filings with the 
Securities and Exchange Commission, including its Annual Report on Form 10-K for the year ended Oct 
31, 2002, its Quarterly Report on Form 10-Q for the quarter ended July 31, 2003 and its Current Report 
on Form 8-K filed Aug. 18, 2003. The company assumes no obligation to update the information in this 
press release. 
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'* Affymetrix Announces Commercial Launch of Single Array for Human Genome 
Expression Analysis 

AFFYMETRIX GENECHIP(R) BRAND HUMAN GENOME U133 PLUS 2.0 ARRAY 

Affymetrix GeneChip(R) Brand Human Genome U133 Plus 2 0 Arrav 
(PRNewsFoto)[AS] y * 
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More Than 1 Million Probes Analyze Expression Levels of Nearly 50,000 RNA 
Transcripts and Variants on a Single Array the Size of a Thumbnail 

SANTA CLARA, Calif., Oct. 2 /PRNewswire/ — Affymetrix, Inc., 
(Nasdaq: AFFX) announced today that it is taking orders for its new 
GeneChip(R) brand Human Genome U133 Plus 2.0 Array, offering researchers the 
protein-coding content of the human genome on a single commercially available 
catalog microarray. The HG-U133 Plus 2.0 Array analyzes the expression level 
of nearly 50,000 RNA transcripts and variants with 22 different probes per 
transcript, providing superior data quality unmatched by technologies using a 
single probe per transcript. 

(Photo: http://www.newscom.com/cgi-bin/prnh/20031002/SFTH021 ) 
"With about 1.3 million probes on a chip the size of a human thumbnail, 
the Human Plus Array represents a leap in array technology data capacity, and 
further .demonstrates the unique power and potential of our technology to 
explore vast areas of the genome," said Trevor J . Nicholls, Ph.D., Chief 
Commercial Officer. "Multiple independent measurements for each transcript 
ensure that our data quality remains the industry standard, even as our data 
capacity increases dramatically." 

The HG-U133 Plus 2.0 Array, which will ship in October, combines the 
content of the previous HG-U133 two-array set with nearly 10,000 new probe 
sets representing about 6,500 new genes, for a total of nearly 50,000 RNA 
transcripts and variants. This new information, verified against the latest 
version of the publicly available genome map, provides researchers the most 
comprehensive and up-to-date genome-wide gene expression analysis. The probe 
design strategy of the HG-U133 Plus 2.0 Array is identical to the previous HG- 
17133 Set, providing very strong data concordance between the two products. 
With more than double the data capacity of the previous -generation Affymetrix 
human product, the HG-U133 Plus 2.0 Array can significantly cut processing and 
analysis time for scientists in the lab, freeing up valuable resources and 
accelerating research. 

The HG-U133 Plus 2.0 Array sets a new standard for the number of genes and 
transcripts on any commercially available single array for human gene 
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, expression analysis, while maintaining Affymetrix' unrivaled data oualitv Th* 
HG-U133 Plus 2.0 Array uses 22 independent measures to detect the 
hybridization of each transcript on the array, 1.3 million data points in all 

t more than 30 times that of any other microarray technology. Using multiple 

independent measurements provides optimal sensitivity and specificity and' the 
most accurate, consistent and statistically significant results possible 

"More data points produce more reliable results and ultimately, enable 
better science," said Nicholls. "Our powerful probe set strategy gives our 
customers the assurance that their array results actually reflect what's in 
their sample." 

Affymetrix is also launching an updated 11 -micron version of its popular 
18-micron HG-U133A Array called the GeneChip HG-U133A 2.0 Array. The reduced 
feature size on this new design means researchers can use smaller sample 
volumes than on the previous 18-micron array without compromising performance 
This new array represents over 20,000 transcripts that can be used to explore 
human biology and disease processes. All probe sets represented on the 
SS-SSS 2 e rSray HG " U133A identicall y replicated on the GeneChip 

More information on the design of the HG-U133 Plus 2.0 Array and the 
HG-U133A 2.0 Array may be found on the Affymetrix website at 
http://www.affymetrix.com . 

Affymetrix will be presenting further information on this and other 
products at the BioTechnica trade show in Hanover, Germany on Oct. 7-9, 2003 
The Company will also hold a press conference on Oct. 7, from 11 a m to 
12 P.m- at the show regarding the new Human Genome U133 Plus 2.0 Array If you 
would like to attend this press conference, please contact Caroline S^upnicS 
at c.stupnicka@northbankcommunications.com. 

About Affymetrix: 

Affymetrix is a pioneer in creating breakthrough tools that are drivinq 
the genomic revolution By applying the principles of semiconductor technology 
to the life sciences, Affymetrix develops and commercializes systems that 
enable scientists to improve the quality of life. The Company's customers 
include pharmaceutical, biotechnology, agrichemical , diagnostics and consumer 
products companies as well as academic, government and other non-profit 
research institutes. Affymetrix offers an expanding portfolio of integrated 
products and services including its integrated GeneChip platform, to address 
growing markets focused on understanding the relationship between genes and 
human health. Additional information on Affymetrix can be found at 
http://www.affymetrix.com . 

All statements in this press release that are not historical are 
"forward-looking statements- within the meaning of Section 21E of the 
Securities Exchange Act as amended, including statements regarding Affymetrix' 
"expectations," "beliefs," "hopes," "intentions," "strategies" or the Uke 
Such statements are subject to risks and uncertainties that could cause actual 
results to differ materially for Affymetrix from those projected, including, 
but not limited to risks of the Company's ability to achieve and sustain 
higher levels of revenue, higher gross margins, reduced operating expenses 
uncertainties relating to technological approaches, manufacturing, product 
development, market acceptance (including uncertainties relating to product 
development and market acceptance of the GeneChip HG-U133 Human Plus 2.0 Array 
and the HG-U133A 2.0), personnel retention, uncertainties related to cost 117 
pricing of Affymetrix products, dependence on collaborative partners 
uncertainties relating to sole source suppliers, uncertainties relating to FDA 
and other regulatory approvals, competition, risks relating to intellectual 
property of others and the uncertainties of patent protection and litigation. 
These and other risk factors are discussed in Affymetrix' Form 10-K for the 
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year ended December 31, 2002 and other SEC reports, including its Quarterly 
Reports on Form 10 -Q for subsequent quarterly periods. Affymftrix exoressW 
disclaims any obligation or undertaking to release puilicirany^nd^ ^ 
revisions to any forward-looking statements contained herein to reject alv 
change « Asymetrix' expectations with regard thereto or any change £ * 
events, conditions, or circumstances on which any such statements Ire based. 

NOTE: Affyrnetrix, the Affyrnetrix logo, and GeneChip and are reqi stered 
trademarks owned or used by Affymetrix, inc. registered 



SOURCE Affyrnetrix, inc. 

Web Site: http://www.affymetrix.com 
Photo Notes: NewsCom: 

http://www.newscom.com/cgi-bin/prnh/20031002/SFTH021 AP Archive- 
http://photoarchive.ap.org PRN Photo Desk, 

photodesk@prnewswire.com 

conTnt ° f re ' eaSeS PR NeWSWire are SO,e 'y ~Pon.ibl. for the accuracy of the 

More n ws from PR Newswire... 

Copyright © 1996.2002 PR Newswire Association LLC. All Rights Reserved 
A United Business Media company. 
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Macroresults through Microarrays 

John C. Rockett, Reproductive Toxicology Division (MD-72), National Health and Environmental Effects Research Laboratory, Office of Research 
and Development, US Environmental Protection Agency, Research Triangle Park, 2525 East Highway 54, Durham, NC 2771 1, USA; 
tel: +1 919 541 2071, fax: +1 919 541 4017, e-mail: rockett.john@epa.gov 



The third enactment of Cambridge 
Healthtech Institute's Macroresults 
through Microarrays meeting was held 
in Boston (MA, USA) from 29 April- 
1 May 2002. The subtheme of this year's 
meeting was 'advancing drug discov- 
ery', a widely touted application for 
array technology. 

The evolution of microarrays 
If you were asked 'Who first conceived - 
of the idea of microarrays', who woujd 
come to mind? Mark Schena perhaps, 
first author of the seminal 1995 paper 
on cDNA arrays [1]? Maybe Pat Brown, 
Schena's then supervisor? Or perhaps 
Stephen Fodor, the primary driver 
behind Affymetrix's (http://www. 
affymetrix.com) oligonucfeotide-based 
platform [2]. Brits might even chant the 
name of Ed Southern [3]. Well, accord- 
ing to Roger Ekins (University College 
London Medical School; http://www. 
ucl.ac.uk/medicine/) all these answers 
would be wrong. It was in fact Ekins 
and his colleagues who first conceived 
of and patented 'a new generation of 
ultrasensitive, miniaturized assays for 
protein and DNA-RNA measurement 
based on the use of microarrays' in the 
mid 1980s [4]. The concept and poten- 
tial of array technology was more fully 
described in a later publication, in 
which Ekins et al. [5] concluded that an- 
tibody microspots of -50 um 2 could be 
achieved, and that as many as 2 million 
different immunoassays could, in prin- 
ciple, be accommodated on a surface 
area of 1 cm 2 . 

Technological innovation 

In practice, it took a different biological 

molecule (DNA), a different research 



group, and a leap into microfabri- 
cation technology to even begin 
approaching these kinds of densities 
[Affymetrix patent 6045996 talks of 
one million spots cm- 2 ]. Of course, 
advancing technology is one of the 
driving engines behind the genomics 
juggernaut, and we are already seeing 
'4th generation' machines for fab- 
- ricating DNA chips. If the company 
representatives at this meeting are to 
be believed (and their cases seemed 
strong), spotting is out, and in situ 
fabrication of oligonucleotide-based 
'iterative custom arrays' is in. Whether 
you go with the Combimatrix's (http:// 
www.combimatrix.com) electrochemi- 
cally directed synthesis and detection 
system, febit's (http://www.febit.com) 
Geniom® technology, or Nimblegen's 
(http://www.nimblegen.com) Maskless 
Array Synthesizer technology is a 
matter of personal choice. However, 
each of these machines provides the 
flexibility to design variable length 
oligonucleotide probes from se- 
quences inputted by the user, and then 
perform in situ synthesis of an array. 
Each system also boasts unique advan- 
tages. For example, Combimatrix's 
biological array processor is a semi- 
conductor coated with a 3D layer 
of porous material in which DNA, 
RNA, peptides or small molecules 
can be synthesized or immobilized 
within discrete test sites, while febit's 
Ceniom One® is a fully integrated 
gene-expression analysis system with 
minimal user hands-on time - the 
probe sequences are programmed, the 
RNA samples inserted, and the gene 
expression data is pumped ut a few 
hours later. 



Cell- and tissue-based arrays 
Array technology is in most people's 
minds firmly linked with gene-expression 
profiling. Fewer are aware that cell- and 
tissue-based arrays have been devel- 
oped, and how they can provide 
a vital extra dimension to research. In 
support of this, Barry Bochner gave an 
update on the cell-based array system 
that Biolog (http://www.biolog.com) 
has produced for simultaneously mea- 
suring the effects of one gene in the cell 
under thousands of growth conditions 
(see [6] for further details). David Walt 
(Tufts University; http:// www. tufts, 
edu/) is developing single live cell ar- 
rays using optical imaging fiber (OIF) 
technology. An array of microwells is 
fabricated on the face of an OIF at den- 
sities of up to 10 million wells cm- 2 . 
Cells are then added to the wells and 
disperse at an average of one cell per 
well. Physiological and genetic re- 
sponses of each cell are measured via 
fluorescence produced by reporter 
genes (e.g. /ocZ, gfp. Assays performed 
so far include yeast live or dead cell 
assay, microenvironment pH and 
0 2 measurements, promotor responses 
using the lacZ and phoA reporter genes, 
and protein-protein interactions using 
the yeast two-hybrid system. The main 
advantage of this system is that the cells 
remain alive during the assay, which 
means a real-time timecourse can be 
performed and/or the array passed 
from sample to sample. This would be 
useful in, for example, the scanning of 
a combinatorial drug library for specific 
physiological effects. 

Tissue arrays are a useful complemen- 
tary technology to DNA arrays because 
they can be used to help validate and 
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understand the biological and medical 
significance of gene changes discov- 
ered using standard DNA arrays. For 
example, an array of tumor tissues can 
be screened for the protein (using im- 
munohistochemistry), message (using 
in situ hybridization) and copy number 
(using comparative genomic hybridiza- 
tion) of a gene of interest, to determine 
if expression of the gene (or lack 
thereof) is related in any way to sur- 
vival. They can also be used to predict 
the probability of clinical failure of lead 
compounds as a result of toxicity by 
evaluating the distribution of the drug 
targets in normal tissue. Spyro Mousses 
and his co-workers at the National 
Human Genome Research Institute 
(http://www.nhgri.nih.gov/index.html) 
have built such arrays, including a 
multi-tumor array (-5000 specimens, 
and sections from 36 normal and 800 
metastatic tissues) and a normal tissue 
array (76 tissue and 332 cell types). 

The problem with proteins 
It has been said that genomics tells us 
what might happen, transcriptomics 
indicates what should happen, and pro- 
teomics shows what is happening. The 
impact of functional proteomics on 
pharmaceutical R&D is rapidly increas- 
ing, and protein arrays are being used 
increasingly in both basic and applied 
research. Their use lies not only in com- 
parative protein expression and inter- 
action profiling, but also in diagnostics 
and drug discovery. However, an in- 
creasing number of researchers have 
found that protein arrays, like their 
cousins the DNA arrays, present several 
practical obstacles relating to their pro- 
duction and use. For example, in using 
Escherichia coli to produce recombi- 
nant eukaryotic proteins from a single 
expression vector, multiple protein 
products are often produced, suggest- 
ing mixes of truncated or otherwise 
altered proteins. There is also the obvi- 
ous concern that the proteins might 
not be modified in a similar manner to 



eukaryotic systems. Also, an optimal 
method for depositing and binding 
proteins to the selected substrate is 
yet to be determined, as is the best 
way to ensure that they are bound in a 
correctly folded, active conformation. 

Several companies have been address- 
ing these problems. Prolinx (http:// 
www.proltnxinc.com) is one such com- 
pany, and Karin Hughes described their 
Versalinx™ chemistry for producing 
protein, peptide and small-molecule 
arrays. Versalinx™ uses solution-phase 
conjugation followed by immobiliza- 
tion, resulting in functional orientation 
of proteins and peptides on the sub- 
strate surface. It also offers the valuable 
additional benefit of exhibiting low 
non-specific binding. Sense Proteomic 
(http://www.senseproteomic.com) is 
also among those addressing these 
problems to develop robust protein 
arrays for drug discovery and clinical 
applications and has developed func- 
tional protein array formats based on 
specific disease tissues. Subtractive hy- 
bridization is used to identify genes 
with altered expression in breast tumor 
and cystic fibrosis compared to normal 
tissue. A high throughput cloning strat- 
egy (COVET™) is then used to produce 
libraries of genes that are tagged, 
cloned, expressed, purified and finally 
immobilized on glass slides. Initial vali- 
dation studies have shown that the vast 
majority of the immobilized proteins do 
indeed retain biological function. 

Stefan Schmidt and his company 
(CPC Biotech; http://www.gpcbiotech. 
de) have moved past the platform devel- 
opment stage and, with their focus 
firmly on drug discovery, are currently 
developing kinase-profiling arrays. 
Kinases are important targets for phar- 
maceutical drug discovery and therapy, 
and CPC's aim is to simultaneously de- 
tect multiple kinases, obtain activity pro- 
files for different cell types, or analyze 
the ability of drug candidates to inhibit 
kinase activity. To do this, recombinant 
kinase substrates are immobilized on 



membranes, incubated with purified 
kinase, and the* substrates measured for 
the degree of ph sphorylation. 

Summary 

Meetings like this, packed with exciting 
discoveries and intriguing and interest- 
ing innovation, heavily emphasize the 
pace at which biotechnology is advanc- 
ing, to the extent that the number of 
options for genomic and proteomic re- 
searchers can become overwhelming. 
Although data analysis is perhaps the 
greatest current concern for array users, 
an increasing challenge will be to deter- 
mine the approaches and technology 
that really work, and to do it in a timely 
manner. 
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A standard two-dimensionaJ (2-D) protein map of Fischer 344 rat liver 
(F344MST3) is presented, with a tabular listing of more than 1200 protein species. 
Sodium dodecyl sulfate (SDS) molecular mass and isoelectric point have been es- 
tablished, based on positions of numerous internal standards. This map has been 
used to connect and compare hundreds of 2-D gels of rat liver samples from a va- 
riety of studies, and forms the nucleus of an expanding database describing rat 
liver proteins and their regulation by various drugs and toxic agents. An example 
of such a study, involving regulation of cholesterol synthesis by cholesterol-lower- 
ing drugs and a high-cholesterol diet, is presented. Since the map has been ob- 
tained with a widely used and highly reproducible 2-D gel system (the Iso-Dalt* 
system), it can be directly related to an expanding body of work in other laborato- 
ries. 
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1 Introduction 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by O'Farrell and others [1-4], has 
been used over the ensuing 16 years to examine a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products [5,6]. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in vitro systems are ideal for answer- 
ing many experimental questions, especially in cancer re- 
search and genetics, our experience with cell cultures and 
tissue samples suggests that some in vivo approaches could 
have major advantages. In particular, we have noticed that 
liver tissue samples from rats and mice appear-to show grea- 
ter quantitative reproducibility (in terms of individual pro- 
tein expression) than replicate cell cultures. This is perhaps 
a natural result of the homeostasis maintained in a com- 
plete animal vs. the well-known variability of cell cultures, 
the latter due principally to differences in reagents (e.g., 
fetal bovine serum), conditions (e.g., pH) and genetic a evo- 
lution" of cell lines while in culture. It is also more difficult 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached cells), forcing the inves- 
tigator to resort to radioisotope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain typically used for protein detec- 
tion in "large" protein samples, they are generally more vari- 
able, more labor-intensive and, in the case of radiographic 
methods, may generate highly "noisy" images, due to the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-40 (NP-40) solubilization and stained with CBB, which 
has the advantage of being easily reproducible [8]. Finally, 
there remains the question of the "truthfulness* of many in 
vitro systems as compared to their in vivo analogs; how 
great are the changes caused by the introduction into a cul- 
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ture and the associated shift to strong selection for growth, 
and how do these affect experimental outcomes? Hence 
the apparent advantages of fa vitro systems, in i terms of ex- 
perimental manipulation, may be counterbalanced by 
other factors relating to 2-D data quality. 

There is a second important class of reasons for exploring 
the use of an fa v/vo biological system such as the liver. His- 
orically there have been two broad approaches to the me- 
chaS c dissection of biochemical processes in intact eel- 
ula? ystems: genetics (a search for informative mutants) 
SI the use of chemical agents (drugs and chemical toxins). 
Both approaches help us to understand complex systems 
by disrupting some specific functional element and show- 
inVus the result. With the development of techniques for 
eenetic manipulation and cloning, the genetic approach 
ran be effectively applied either fa vitro or fa v/vo, although 
the in vitro route is usually quicker. The chemical approach 
can also be applied to eithersort of biological system; here, 
however the bulk of consistently acquired information is 
in experimental animals (rats and mice). While most biolo ; 
cists know a short list of compounds having specific, experi- 
mentally useful effects (*.*., inhibitors of protein synthesis, 
ionophores, polymerase inhibitors, channel blockers, nu- 
cleotide analogs, and compounds affecting polymerization 
of cytoskeletaJ proteins), there is a much larger number of 
interesting chemically-induced effects, most of them char- 
acterized by toxicologists and pharmacologists in rodent 
cynems. Just as a thorough genetic analysis would involve 
saturating a genome with mutations, it is possible to ima- 
gine a saturating number of drugs, the analysis of whose ac- 
tions would reveal the complete biochemistry of the cell. 
While organized drug discovery efforts usually target spe- 
cific desired effects, the nature of the process, with its de- 
pendence on screening large numbers of compounds, ne- 
cessarily produces many unanticipated effects. It is there- 
fore reasonable to suppose that the required broad range of 
compounds necessary to achieve "biochemical saturation 
may be forthcoming; in facl.it may already exist among the 
hundreds of thousands of compounds that failed to qualify 
as drugs. 

Among organs, the liver is an obvious choice for the study 
of chemical effects because of its well-known plasticity and 
responsiveness. The brain appears to be quite plastic {e.g. 
[7]) but it is a complicated mixture of cell types requiring 
skillful dissection for most experiments. The kidney, while 
quite responsive, also presents a potentially confounding 
mixture of cell types. The liver, by contrast, is made up of 
one predominant cell type which is easy to solubilize: the 
hepatocyte, representing more than 95% of its mass. Most 
importantly, the liver performs many homeostatic func- 
tions that Tequire rapid modulation of gene expression.lt 
appears that most chemical agents tested affect gene ex- 
pression in the liver at some dosage (N. Leigh Anderson, 
unpublished observations), an interesting contrast to our 
earlier work with lymphocytes, for example, which seem to 
be much less responsive. Such results conform to the expec- 
tation that cells with a homeostatic, physiological role 
should be more plastic than cells differentiated for a pur- 
pose dependent on the action of a limited number of spe- 
cific genes. 

The liver also allows the parallels between in vitro and in 
vivo systems to be examined in detail. Significant progress 
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has been made in the development of mouse, rat and hu- 
man hepatocyte culture systems, as well as in precision-cut 
tissue slices. Using such an array of techniques, it is possi- 
ble to assemble a matrix of mammalian systems including 
mouse and rat in vivo on one level and mouse, rat and hu- 
man in vitro on a second level, and to compare effects be- 
tween species and between systems. This approach allows 
us to draw informed conclusions regarding the biochemical 
"universality" of biological responses among the mammals, 
and to offer some insight into the validity of in vitro ap- 
proaches for toxicological screening. We believe this data 
will be necessary if in vitro alternatives are to achieve wide 
usage in government-mandated safety testing of drugs, con- 
sumer products and industrial and agricultural chemicals. 

A number of interesting studies have been published using 
2-D mapping to examine effects in the rodent liver. A num- 
ber of investigarors have made use of the technique to 
screen for existing genetic variants [8-1 1J or induced muta- 
tions 112-14], mainly in the mouse. This work builds on the 
wealth of genetic information available on the mouse and 
its established position as a mammalian mutation-detec- 
tion system. While some studies of chemical effects have 
been undertaken in the mouse [15-17], most have used the 
rat [18-23]. The examination of the cytochrome p-450 sys- 
tem, in particular, has been carried out almost exclusively 
on the rat [24, 25]. 

These considerations lead us to conclude that rodent liver 
offers the best opportunity to systematically examine an 
array of gene regulation systems, and ultimately to build a 
predictive model of large-scale mammalian gene control. 
The basic underlying foundation of such a project is a reli- 
able, reproducible master 2-D pattern of liver, to which on- 
going experimental results can be referred. In this paper, we 
report such a master pattern for the acidic and neutral pro- 
teins of rat liver (pattern F344MST3).ln future,this master 
will be supplemented by maps of basic proteins, and analog- 
ous maps of mouse and human liver. 



2 Materials and methods 
2.1 Sample preparation 

Liver is an ideal sample material for most biochemical stud- 
ies, including 2-D analysis. A sample is taken of approxima- 
tely 0.5 g of tissue from the apical end of the left lobe of the 
liver. Solubilization is effected as rapidly as practical; a 
delay of 5-15 min appears to cause no major alteration in 
liver protein composition if the liver pieces are kept cold 
{e.g., on ice) in the interim. In the solubilization process, 
the liver sample is weighed, placed in a glass homogenizer 
{e.g., 15 mLWheaton); 8 volumes of solubilizing solution 

• The solubilizing solution is composed or 2% NP-40 (Sigma), 9 m urea 
(analytical grade, e.g., BDH or Bio-Rad), 0.5% dithiothrcitol (DTT; 
Sigma) and 2% carrier ampholytes (pH 9-11 LKB: these come as a20% 
stock solution, so 2 % final concentration is achieved by making the final 
solution 10% 9-11 Ampholine by volume). A large batch orsolubihier 
(several hundred mL) is made and stored frozen at -80°C in aliquots 
sufficient to provide enough Tor one day's estimated sample prepara- 
tion requirement. The solution is never allowed to become warmer 
than room temperature at any stage during preparation or thawing lor 
use. since heating of concentrated urea solutions can produce conumi- 
nants that covalenlly modify proteins producing artiraciual charge 
shifts. Once thawed, any unused solubiliier is discarded. 
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is added (i.e., 4 mLper 0.5 g tissue) end the mixture is ho- 
mogenized using first the loose- and then then the tight-fit- 
ting glass pestle. This takes approximately 5 strokes with 
each pestle and is carried out at room temperature because 
urea would crystallize out in the cold. Once the liver sample 
is thoroughly homogenized in the solubitoerjt is assumed 
that all tbeproteins are denatured (by the chaotropic effect 
of the urea and NP-40 detergent) and the enzymes inacti- 
vated by the high pH (-9.5). Therefore these samples may 
be kept at room temperature until they can be centrifuged 
or frozen as s group (within several hours of preparation). 
The samples are centrifuged for 6 X 10* £ min (e.g., 500 000 
X g for 12 min using a Beckman TL-100 centrifuge). The 
centrifuge rotor is maintained at just below room tempera- 
ture (e.g., 15-20°C), but not too cold, so as to prevent the 
precipitation of urea. The centrifuge of choice is a Beckman 
TL-100 because of the sample lube sizes available, but any 
ultracentrifuge accepting smallish tubes wilJ suffice. When 
an appropriate centrifuge is not available near the site of 
sample preparation, samples can be frozen at -S0°C and 
thawed prior to centrifugation and collection of superna- 
tants.Each supernatant is carefully removed following cen- 
trifugation and aliquoted into at least A clean tubes forstor- 
age.This is done by transferring aJl the supernatant to one 
clean tube, mixing this gently (to assure homogeneous 
composition) and then dividing it into A aliquots.Tbe ali- 
quots are frozen immediately at -S0°C. These multiple ali- 
quots can provide insurance against a failed run or a freezer 
breakdown. 

2.2 Two-dimensional electrophoresis 

Sample proteins are resolved by 2-D electrophoresis using 
the 20 X 25 cm Iso-Dalt* 2-D gel system ([26-29]; pro- 
duced by LSB and by Hoefer Scientific Instruments, San 
Francisco) operating with 20 gels per batch. All first-dimen- 
sional isoelectric focusing (IEF) gels are prepared using the 
same single standardized batch of carrier ampholytes 
(BDH 4-8A in the present case, selected by LSB's batch- 
testing program for rat and mouse database work**). A 10 
uL sample of solubilized liver protein is applied to each gel, 
and the gels are run for 33 000 to 34 500 volt-hours using a 
progressively increasing voltage protocol implemented by 
a programmable high-voltage power supply. An Ange- 
lique" computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
ium dodecyl sulfate (SDS) polyacrylamide gradient slab 
gels in which the top 5 % of the gel is 1 1 %T acrylamide, and 
the lower 95% of the gel varies linearly from 11% to 18%T 

This system has recently been modified so as to employ a 
commercially available 30.8 %T acrylamide///, TV-methyle- 
nebisacrylamide prepared solution (thus avoiding the han- 
dling of the solid acrylamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
Tris), persulfale and MA^AMetramethylethylenedi- 
araine (TEMED). Each, gel is identified by a computer- 
printed filter paper label polymerized into the lower left cor- 
ner of the gel. First-dimensional IEF tube gels are loaded 



" This material (succeeding certified batches of which are available from 
Hoefer Scientific Instruments) has the most linear pH gradient pro- 
duced by any ampholyte tested except for the Pharmacia wide range 
(which has an unacceptable tendency to bind high-molecular weight 
acidic proteins, causing them lo streak). 



directly (as extruded) onto the slab gels without equilibra- 
tion, and held in place by polyester fabric wedges (Wed- 
gies", produced by LSB) to avoid the use of hot agarose. 
Second-dimensional slab gels are run overnight, in groups 
of 20, in cooled DALT tanks (10 e C) with buffer circulation. 
All run parameters, reagent source and lot information, 
and notations of deviation from expected results are ente- 
red by the technician responsible on a detailed, multi-page 
record of the experiment. 

23 Staining 

Following SDS-electrophoresis, slab gels are stained for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1 L of gel) per box. This procedure (based on the work 
of NeuhoJT[30,31]) involves fixation in 1.5 L of 50% etha- 
nol and 2% phosphoric acid for 2h, three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34% 
methanol, 17% ammonium sulfate and 2 % phosphoric acid 
for 1 h, followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon gels are 
transferred to cool tap water and their surfaces rinsed to re- 
move any particulate stain prior to scanning. Gels may be 
kept for several months in water with added sodium azide. 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncolloidal, with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is diluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows: (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometry; (ii) up to 1500 spots can be reli- 
ably delected on many gels (e.g., rat liver) at loadings low 
enough to preserve excellent resolution; and (in) reprodu- 
cibility appears to be very good: at least several hundred 
spots have coefficients of reproducibility less than 15%. 
This value is at least as good as previous CBB methods, and 
significantly better than many silver stain systems. 

2.4 Positional standardization 

The carbamylated rabbit muscle creatine phosphokinase 
(CPK) standards [32] are purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
taken from the Protein Identification Resource (PIR) se- 
quence database [33]. 



2.5 Computer analysis 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
print prepared from the raw digital image as hard-copy 
backup of the gel image. Gels are processed using the Kep- 
ler* software system (produced by LSB), a commercially 
available workstation-based software package built on 
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some of the principles of the earlier TYCHO system [34- 
4]]. Procedure PROC008 is used to yield a spotiist giving 
position, shape and density information for each detected 
spot. This procedure males use of digital filtering, mathe- 
matical morphology techniques and digital masking to re- 
move the-background, and uses full 2-D least-squares opti- 
mization to refine the parameters of a 2-D Gaussian shape 
for each spot. Processing parameters and file locations are 
stored in a relational database, while various log files detail- 
ing operation of the automatic analysis software are ar- 
chived with the reduced daxa.The computed resolution and 
level of Gaussian convergence of each gel are inspected 
and archived for quality control purposes. 

Experiment packages are constructed using the Kepler ex- 
periment definition database to assemble groups of 2-D 
patterns corresponding to the experimental groups (e.g., 
treated and control animals). Each 2-D pattern is matched 
to the appropriate "master" 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
providing linkage to the existing rodent protein 2-D data- 
bases. The software allows experiments containing hun- 
dreds of gels to be constructed and analyzed as a unit, with 
up to 300 gels displayed on the screen at one time for com- 
parative purposes and multiple pages to accommodate ex- 
periments of > 1000 gels. For each treatment, proteins 
showing significant quantitative differences vj. appropriate 
controls are selected using group-wise statistical parame- 
ters (e.g., Student's t-test, Kepler* procedure STUDENT). 
Proteins satisfying various quantitative criteria (such as P< 
0.001 difference from appropriate controls) are repre- 
sented as highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations (i.e., logi- 
cal vectors) in a liver protein database. Quantitative data 
(spot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analysis of co- 
regulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinately regulated by 
any of the treatments. Such groups can be presented graphi- 
cally on a protein map,and reported together with the statis- 
tical criteria used to assess the level of coregulation. Multi- 
variate statistical analysis (e.g., principal components 1 ana- 
lysis) is performed on data exported to SAS (SAS Institute). 

2.6 Graphical data output 

Graphical results are prepared in GKS and translated 
within Kepler* into output for any of a variety of devices. 
Linedrawing output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resolution 
Postscript-compatible Linotronic output device. Greyscale 
graphics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the standard 
orientation, with high molecular mass at the top and acidic 
proteins to the left. 

2.7 Experiment LSBC04 

In the study described here 12-week-old Charles River 
male F344 rats were used. Diets were prepared at LSB, 
based on a Purina 5755M Basal Purified Diet. Lovastatin 
and cholestyramine were obtained as prescription pharma- 



ceuticals, ground and mixed with the diet at concentrations 
of 0.075 % and 1 %, respectively. The high cholesterol diet 
was Purina 5801M-A (5% cholesterol plus 1 % sodium cho- 
late in the control diet). Animal work was carried out by Mi- 
crobiological Associates (Bethesda,MD). Animals were ac- 
climatized for one week on the control diet, fed test or con- 
trol diets for one week, and sacrificed on day 8. Average 
daily doses of lovastatin and cholestyramine in appropriate 
groups were 37 rog/kg/day and 5 g/kg/day, respectively, 
based on the weight of the food consumed. Liver samples 
were collected and prepared for2-D electrophoresis accord- 
ing to the standard liver protocol (homogenization in 8 
volumes of 9 m urea, 2% NP-40, 0.5% dithiothreitol, 2% 
LKB pH 9-11 carrier ampholytes, followed by centrifuga- 
tion for 30 min at 80000 X g). Kidney, brain and plasma 
samples were frozen. Gels were run as described above, 
and the data was analyzed using the Kepler* system. Gels 
were scaled, to remove the effect of differences in protein 
loading, by setting the summed abundances of a large num- 
ber of matched spots equal for each gel (linear scaling). 



3 Results and discussion 

3.1 The rat liver protein 2-D map 

F344MST3 is a standard 2-D pattern of rat liver proteins, 
based on the Fischer 344 strain. This pattern was initiated 
from a single 2-D gel and extensively edited in an experi- 
ment comparing it to a range of protein loads, so as to in- 
clude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D patterns 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) have 
been added as a result. A modified version including addi- 
tional spots present in the Sprague-Dawley outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 10 pLof solubi- 
lized liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear in the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant, Fig. 4 the upper right (basic, 
high molecular mass) quadrant, Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units), isoelectric point (relative to the CPK in- 
ternal p/standards) and SDS molecularmass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table 1). Be- 
cause of the precision of the CPK-p7 values, these parame- 
ters can be used to relate spot locations between gel sys- 
tems more reliably than using pi measurements expressed 
as pH. A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pro- 
teins, as well as rigorous definitions of subcellular rga- 
nelle contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse liver 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results of these studies will be 
presented systematically in a later edition of this database, 
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but we include here a useful series of 22 orienting identifi- 
cations as an aid to other users of the rat liver pattern (Table 
2). 

32 Carbamylaied charge standards, computed p/s and 
molecular mass standardization 

We have previously shown that the use of a system of close- 
ly-spaced internal p/ markers (made by carbamylating a 
basic protein) offers an accurate and workable solution to 
the problem of assigning positions in the p/ dimension [32]. 
The same system, based on 36 protein species made by car- 
bamylating rabbit muscle CPK, has been used here to as- 
sign p/s to most rat liver acidic and neutral proteins. The 
standards were coelectrophoresed with total liver proteins, 
and the standard spots added to a special version of the 
master pattern F344MST3. The gel ^-coordinates of all 
liver protein spots lying within the CPK charge train were 
then transformed into CPK p/ positions by interpolation 
between the positions of immediately adjacent standards 
(Table 1) using a Kepler* vector procedure. 

It has proven possible to compute fairly accurate pi values 
for many proteins from the amino acid composition [42]. 
We have attempted here to test a further elaboration of this 
approach, in which we computed p/s for the CPK standards 
themselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact that adjacent members of the 
charge train typically differ by blockage of one additional ly- 
sine residue (Table 3). We compared these values to similar 
computed p/s for an additional set of carbamylated stand- 
ards made from human hemoglobin beta chains and a se- 
ries of rat liver and human plasma proteins of known posi- 
tion and sequence (Fig. 7,Table 4).The result demonstrates 
good concordance between these systems. Two proteins 
show significant deviations: liver fatty-acid binding protein 
(FABP; #1 in Table 4) and protein disulphide isomerase 
(*20 in the table). The FABP spot present on F344MST3 
may represent a charge-modified version of a more basic 
parent spot closer to the expected p/, not resolved in the 
1EF/SDS gel. Of particular importance is the fact that, by 
comparing computed p/s of sequenced but unlocated pro- 
teins with the CPK p/s, we can assign a probable gel loca- 
tion without making any assumptions regarding the actual 
gel pH gradient. This offers a useful shortcut, given the va- 
garies of pH measurement on small diameter IEF gels. We 
have used this approach to compute the CPK p/s of all rat 
and mouse proteins in the P1R sequence database, as an aid 
to protein identification (data not shown). 

In orderto standardize SDS molecularweight (SDS-MW), 
we have used a standard curve fitted to a series of identified 
proteins (Fig. 8). Rather than using molecular mass perse, 
we have elected to use the number of amino acids in the 
polypeptide chain, as perhaps a better indication of the 
length of the SDS-coated rod that is sieved by the second 
dimension slab. The resulting values were multiplied by 
312 (the weighted average mass of amino acids in se- 
quenced proteins) to give predicted molecular masses. Be- 
cause we use gradient slabs, we have not constrained the fit- 
ted curve to conform to any predetermined model; rather 
we tried many equations and selected the best using the 
program "Tab I ecu rve" on a PC. The equation chosen wasy 
= a + bx+ c/x\ where .k is the number of residues, x is the gel 
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resulting fit appears to be fairly good over a broad range of 
molecular mass. 

33 An example of rat liver gene regulation: Cholesterol 
metabolism 

Experiment LSBC04 was designed as a small-scale test of 
the regulation of cholesterol metabolism in vivo by three 
agents included in the diet: lovastatin (Mevacor*,an inhibi- 
tor of HMG-CoA reductase); cholestyramine (a bile acid 
sequestrant that has the effect of removing cholesterol 
from the gut-liver recirculation); and cholesterol itself. The 
first two agents should lower available cholesterol and the 
third should raise it, allowing manipulation of relevant 
gene expression control systems in both directions. Such 
an experiment offers an interesting test of the 2-D mapping 
system since most of the pathway enzymes are present in 
low abundance, many are membrane-bound and difficult 
to solubilize, and the pathway itself is complex. Approxima- 
tely 1000 proteins were separated and delected in liver ho- 
mogenates. Twenty-one proteins were found to be afTected 
by at least one treatment, and these could be divided into 
several coregulated groups. 

3.3.1 MSN 413 (putative cytosolic HMG-CoA synthase) 
and sets of spots regulated coordinated or inversely 

One group of spots (including a spot assigned to the cyto- 
solic HMG-CoA synthase, MSN 4 13) showed the expected 
increase in abundance with lovastatin or cholestyramine, 
the synergistic further increase with lovastatin and choles- 
tyramine, and a dramatic decrease with the high cholesterol 
diet. Spot number 413 is the most strongly regulated pro- 
tein in the present experiment, showing a 5- to 10-fold in- 
duction after a 1 week treatment with 0.075% lovastatin and 
1 % cholestyramine in the diet (Figs. 9 and 10). Its expres- 
sion follows precisely the expectation for an enzyme whose 
abundance is controlled by the cholesterol level; it is pro- 
gressively increased from the control levels by cholestyra- 
mine, lovastatin and lovastatin plus cholestyramine, and it 
sinks below the threshold of detection in animals fed the 
high cholesterol diet. This spot has been tentatively identi- 
fied as the cytosolic HMG-CoA synthase, based on a reac- 
tion with an antiserum to that protein provided by Dr. Mi- 
chael Greenspan at Merck Sharp & Dohme Research Labo- 
ratories. This enzyme lies immediately before HMG-CoA 
reductase in the liver cholesterol biosynthesis pathway,and 
is known to be co-regulated with it. Spot 413 has an SDS 
molecular weight of about 54 000 and a CPK p/of-11.4,in 
reasonably close agreement with a molecular weight of 
57300 and a CPK p/of -15.7 computed from the known se- 
quence of the hamster enzyme [43]. 

Using a classical product-moment correlation test (Kepler 
procedure CORREL), a series of five additional spots was 
found to be coregulated with 413. The level f correlation 
was exceedingly high (> 95%). Two of these, 1250 and 933, 
are at similar molecular weights and approximately one 
charge more acidic than 413 (Fig. 9), indicating that they 
may be covalently modified forms of the 413 polypeptide. 
This suspicion is strengthened by the observation that both 
spots are also stained by the antibody to cytosolic HMG- 
CoA synthase. The remaining three correlated spots appear 
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to comprise an additional related pair (1253 and 1001) of 
around 40 kDa and a single spot (1119) of around 28 kDa. 
Because these two presumed proteins are present at sub- 
stantially lower abundances than 413, and because the cyto- 
solic HMG-CoA synthase is reported to consist of only one 
tvpe of polypeptide, they are likely to represent other, very 
tightly coregulated enzymes. A second group of six spots 
was selected based on a regulatory' pattern close to the in- 
verse of that for spot 413 (MSN's 34,79, 178, 381 204,347; 
data not shown). For these proteins, the lowest level of ex- 
pression occurs with exposure to lov £S tatin plus cholestyra- 
mine and the highest level upon exposure to the high-cho- 
lesterol diet. Spots 182 and 79 are highly correlated and he 
about one charge apart at the same molecular weight; ihey 
may thus be isoforms of a single protein. The other four 
spots probably represent additional enzymes or subunits. 

3 32 MSN 235 and coregulated spots 

A third group of five spots, mainly comprised of mitochon- 
drial proteins including putative mitochondrial HMG- 
CoA synthase spots, showed a modest induction by lovasta- 
tin alone, but little or no effect with any of the other treat- 
ments (including the combination of lovastatin and choles- 
tyramine; Fig. 12). This result is intriguing because lovasta- 
tin was expected to affect only the regulation of enzymes of 
cholesterol synthesis, which is entirely exxns-miiochon- 
drial. Three of the spots (235, 134, 144) form a closely- 
packed triad at approximately 30 kDa, and are likely to re- 
present isoforms of one protein. All three spots are stained 
by an antibody to the mitochondria] form of HMG-CoA 
synthase obtained from Dr. Greenspan. Subcellular fractio- 
nation indicates a mitochondrial location. The other two 
spots (633 at about 38 kDa and 724 at about 69 kDa) are 
each present at lower abundance than the members of the 
triad. 



proteins of the putative mitochondrial pathway are so 
much more variable in their expression in all groups. An ex- 
amination of all the coregulated groups suggests that quan- 
titative siatistical techniques can extract a wealth of inter- 
esting information from large sets of reproducible gels.The 
abundance of spots in the 4 13 coregulation group, for exam- 
ple, shows an amazing level of concordance in their relative 
expression among the five individuals of the lovastatin and 
cholestyramine treatment group. This effect is not due to 
differences in total protein loading, since they have already 
been removed by scaling, and since proteins with quite dif- 
ferent regulation patterns can be demonstrated (e.g.. Fig. 
13). Such effects raise the possibility that many gene coregu- 
lation sets may be revealed through the study of a suffi- 
ciently large population of control animals (i.e., without 
any experimental manipulation). This approach, exploiting 
natural biological variation in protein expression instead of 
drug effects, offers an important incentive for the construc- 
tion of a large library of control animal patterns. 



4 Conclusions 

Because of the widespread use of rat liver in both basic bio- 
chemistry and in toxicology, there is a long-term need for a 
comprehensive database of liver proteins. The rat liver mas- 
ter pattern presented here has proven to be an accurate re- 
presentation of this system, having been matched to more 
than 700 gels to date. As the number of proteins identified 
and the number of compounds tested for gene expression 
effects grows, we expect this database to contribute valu- 
able insights into gene regulation. Its practical utility in sev- 
eral areas of mechanistic toxicology is already being de- 
monstrated. 
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333 An example of an anti-synergistic effect 

A sixth spot (367) shows strong induction by lovastatin 
(two- to threefold), and about half as much induction with 
lovastatin plus cholestyramine, but without sharing the ani- 
mal-animaJ heterogeneity pattern of the 235-set (Fig. 13). 
This protein is also mitochondrial, and represents the clear- 
est example of an anti-synergistic effect of lovastatin and 
cholestyramine. The existence of such an effect demon- 
strates that lovastatin and cholestyramine do not act exclu- 
sively through the same regulatory pathway. 

33.4 Complexity of the cholesterol synthesis pathway 

Taken together, these results suggest that treatment with lo- 
vastatin alone can affect both cytosolic and mitochondrial 
pathways using HMG-CoA, while cholestyramine, on the 
other hand, either alone or in combination with lovastatin, 
produces a strong effect on the putative cytosolic pathway, 
but little or no effect on the putative mitochondrial path- 
way. An explanation for this difference may lie in lovasta- 
tin's effect on levels of HMG-CoA and related precursor 
compounds that are exchanged between the cytosol and 
the mitochondrion, whereas cholestyramine should affect 
only the cytosolic pathways directly controlled by cholester- 
ol and bile acid levels. It remains to be explained why some 



5 References 

[I] O'Farrell, P.,/ Biol. Chem. 1975. 250, 4007-4021. 
[2] KJose, J., Humongenetik 1975 . 26, 231-243. 

[3] Scheele, G. A.. J. Biol. Chcm. 1975, 250, 5375-5385. 
|4] Iborra, G. and Buhler, J. M., Anal. Biochem. 1976, 74 ', 503-511. 
(5) Anderson, N. G. and Anderson, N. U, Bchring. Inst. Mitt. 1979, 6*5, 
169-210. 

[6] Anderson,N.G.and Anderson,N.L., din. Chcm. 1982,25,739-748. 

[7] Heydorn, W. E., Creed, G. J. and Jacobowitz, D. M., J. Pharmacol. 
Exp. Thcrap. 1984, 229, 622-628. 

[8] Anderson,N. L.. Nance, S.L..Tollaksen,S.L.,Gicre,F. A. and Ander- 
son, N. G., Electrophoresis 1985, 6; 592-599. 

[9] Racine, R. R. and Langley, C. H., Biochem. Genet. 1980, 18, 185-197. 
[10] Klose, J., Mol. Evol. 1982, 18, 315-328. 

[I I] N eel, J. V., Baier, L.. Hanash, S. and Erickson, R. P.,/. Hered. 1985, 76, 
314-320. 

[32] Marshall, R.R.,Raj, A. S.,Granl,F. J. and Heddle,J.A.,Cfln.y.Cfn«. 
Cytol. 1983, 25, 457-446. 

[13] Taylor, J., Anderson, N.L., Anderson, N.G.,Gemmell,A.,GiomcUi, 
C. S., Nance, S. L. and Tollaksen, S. L., in: Dunn, M.J. (Ed. ) f Electro- 
phoresis 'S6,Verlag Chemie, Weinheim 1986, pp.. 583-587. 

[14] Giometti, C. S., Gemmell, M. A., Nance. S. UTollaksen, S. L. and 
Taylor, J., J. Biol. Chem. 1987 , 262, 12764-12767. 

|15] Anderson, N. L., Giere, F. A., Nance, S. L., Gemmell, M. A., Tollak- 
sen, S. L. and Anderson, N. G., in: Galtcau, M.-M. and Siest, G. 
(Eds.), Progres Recents en Electrophorcse BidimensioneUc, PresstS 
Universitaires de Nancy, Nancy 1986, pp. 253-260. 

[16] Anderson, N. L., Swanson, M., Giere, F. A., Tollaksen, S., Gemmell, 
A., Nance, S. L. and Anderson, N. G., Electrophoresis 1986, 7, 44-48. 



electrophoresis 1991. />. 907-930 



Diubuc of ret liver proteins 913 



[17) Anderson. N. L, Giere, F- A., Nance, S. L., Gemmell, M. A.,Tollak- 
sen, S.L.and Anderson, N. G.,Fundcm.Appl. Toxicol. 1987, 5,39-50. 

[18J Anderson. N.L., in : Sew Horizons in Toxicology, Eli Lilly Symposium, 
1991, in press. 

[19] Anioine, B., Rahimi-Pour, A., Siest, G., Magdalou, J. and Galteau, 

M. M., Cell. Biochem. Fvncu 1987, J, 217-231. 
[20] Elliott. B. M., Ramasamy, R-, Stonard, M. D. and Spragg, S. P. Bio- 

chim. Biophys. Acio 19E6, 810, 135-140. 
[2 1] Huber, B. E., Hcilman, C. A., Winn, P. J., Miller, M.J. and Thorgeirs- 

son, S. S., Htpciolozy 1986, 6, 209-219. 
[22] Winh, P.J. and Vesierbcrg, 0., Electrophoresis 1988, P, 47-53. 
[23] Wiumann, F. A. and Parker, D. N., Toxicol. Lett. 1991, 57. 29-36. 
[24] Rampersaud, A., Waxman, D. J., Ryan, D. E., Levin, W. and Wal2, F. 

C.ir., Arch. Biochem. Biophys. 1985, 243. 174-183. 
[25] Vlasuk, G.P.and VJh\z.T.G..U.,Ancl. Biochem. 1980, JOS. 112-120. 
[26] Anderson, N.G. and Anderson, N.L.,Anol. Biochem. 1S7S, £5,331- 

340. 

[27] Anderson, N.L. and Anderson, f*.G.,Anal. Biochem. 1978, £5.341- 
354. 

[28] Anderson, I~. Hofmann. J.-P., Anderson, E., Walker, B. and Ander- 
son, N. G., in: Endler, A. T. and Hanash, S. (Eds.), Two- Dimensional 
Electrophoresis, VCH Verlagsgesellschafl, Weinheim 1989, pp. 288- 
297. 

[29] Anderson, L., Two-Dimensional Electrophoresis: Operation of the 
JSO-DAL7* System. Large Scale Biology Press, Washington, DC 
1988, ISBN 0-945532-OO-S, 170pp. 

[30] NeuhofT,V.,Siamm,R.and£ibl,H.,£/«/rop/io/'ej/j 1985,6,427-448. 



[31] NeuhoiT,V.,Arold,N.,Taube,D.and Ehrhardt, W„ Electrophores u 
1988, 9, 255-262. 

[32] Acderson,N.L.andHickman,B.J.,>4nc/.fi/ocAeni.l979,W,3l2-320. 

[33] Sidman, K. E., George, D. E., Barker, W. C and Hunt, L. T- NvcL 
Acids Res. 1988, 76, 1S69— 1871. 

[34] Taylor, J., Anderson, N. L.. Coulter. B. P., Scandora, A. E. and Ander- 
son, N.G. , in: Radola,B J. (Ed. ),£/«fro/>Aor«w 79.de Gruyier, Ber- 
lin 1980, pp. 329-339. 

[35) Taylor, J., Anderson, N. L. and Anderson, N, G., in: AJJen, R. C. and 
* Amaud, P. (Eds.), Electrophoresis '8 J, de Gruyier, Berlin 1981, pp. 
383-400. 

[36] Anderson, N.L.,Taylor,J M Scancora,A.E.,Coulter 1 B.P.and Ander- 
son, N. G., Clin. Chem. 1981, 27, 1807-1820. 

[37] TaylorJ..Anderson,N.L.,Scandora,A.E.,Jr„Wiilard,K.E.and An- 
derson, N. G., Clin. Chem. 1982, 28, £61-866. 

[3 8 J Ta y lor, J. . Ande rson. N. L. and Anderson, N.G. , Electrophoresis 1983, 
4, 338-345. 

[39] Anderson, N. L. and Taylor, J., in: Proceedings of the Fourth Annual 
Conference and Exposition of the Actional Computer Graphics Associa- 
tion, Chicago, June 26-30, 1983, pp.' 69-76. 

[401 Anderson, N. L„ Hofmann, J.-P., Gemmell, A. and Taylor, J., Clin. 
Chem. 1984, 30, 2031-2036. 

[411 Anderson, L.. in: S chafe r-Nielsen.C( Ed. ), Electrophoresis 'W.VCH 
Verlagsgesellschafi, Weinheim 1988. pp. 313-321. 

[42] Neidhardt, F. C, Appleby, D. A.. Sankar, P., Hution, M. E. and Phil- 
lips, T. A., Electrophoresis 1989. 10, 116-121. 

[43] Gil, G., Goldstein, J. L., Slaughter, C. A. and Brown, M. S., J. Biol. 
Chem. 1986, 261, 3710-3716. 



914 L Andenon r f « 

6 Addendum 1: Figures 1-13 



Electrophoresis 1991. /^.90T-9J0 




Figure J. Synthetic representation of the standard rat liver 2-D master pattern, rendered as a greyscale image using a videoprinter. 
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Figure 2. Schematic representation of the master pattern (the same as Fig. 1), useful as an aid in relating specific areas of Fig. 1 and the following detailed 
quadrants. 
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/V*«« 3. Upper left (high molecular weight, acidic) quadrant (#1) of the rai liver map, showing spot numbers. 
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Figure 8. Plot of number of amino acids versus gel X-posiiion, with fitted 
curve used to predict molecular mass of unidentified proteins. 
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4 Figure 7. (a) Plot of computed isoelectric point versus gel JT-positfon for 
two sets of carbamylated standard proteins (rabbit muscle CPK [+] and 
human hemoglobin 0 chain, filled diamonds) and several other proteins 
(shaded squares), (b) The identities of the various proteins represented 
by the squares are indicated by the numbers in corresponding positions 
on (a); these refer to Table 4. 
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Figure P. Montage showing effects in the 
region of MSN :4 13. The montage shows a 
small window into one portion of the 2*D 
pattern, one row of windows for each expe- 
rimental group, and one panel for each gel 
in the experiment. The left-most pattern 
in each row is a group-specific copy of the 
master pattern followed by the patterns 
for the five individual rats in the group. 
The highlighted protein spots (filled circ- 
les) are spot 4 13 (on the right of each pan- 
el; identified as cytosolic HMG-CoA syn- 
thase) and two modified forms of it (1250 
and 933). From the top, the rows (experi- 
mental groups) are: high cholesterol, con- 
trols, cholestyramine, lovastatin, and lova- 
statin plus cholestyramine. 
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Figure 10, Bargraph showing the quantita- 
tive effects of various treatments on the 
abundance of MSN:4U (cytosolic HMG- 
CoA synthase) in the gets of Fig. 9. 




Figure U. Bargraphs of a series of six core- 
gulatcd spots including MSN:413. In the 
bargraphs, the abundances of the appro- 
priate spot (master spot number shown at 
the top of the panel) in each animal are 
shown. The five five-animal groups are in 
the order (left to right): high cholesterol, 
controls, cholestyramine, lovastatin, and 
lovastatin plus cholestyramine. Each bar 
within a group represents one experimen- 
tal animal liver (one 2-D gel). Note the cor- 
related expression of tbe 6 spots, espe- 
cially in the two far right (most strongly in- 



duced) groups. 
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Figure 12. Data on a second coregulatcd 
group of spots, presented as in Fig. 1 l.The 
fourth experimental group (lovastatin) 
shows a modest induction, while the fifth 
group (lovastatin plus cholestyramine) 



does not. 




Figure IS, Data on spot MSN:367, presented as in Fig. 11. This protein 
shows unambiguously the anti-synergistic effect of lovastatin and choles- 
tyramine (fifth group) as compared to lovastatin (fourth group). This res- 
ponse contrasts strongly with the regulation pattern seen in Fig. 11. 
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Table 1. Master table or proteins in the rat liver database" 
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1318 
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72 
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73 
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74 
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75 
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76 
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-1.0 


74.500 


158 


1275 
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83 
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-0.9 


89.100 


164 


1566 


806 


87 


516 


1371 


-27.0 
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-3.5 
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91 
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92 
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93 
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-4.6 
-4.8 
-2.4 
<35.0 
-6.7 
-5.7 
-22.1 
>0.0 
>0.0 
-10.7 
-6.9 
-25.7 
•2.8 
•7.9 
•4.5 
-2.1 
-13.5 
-11.4 
>0,0 
-6.1 
-7.8 
-2.6 
-11.4 
>0.0 
-11.6 
•3.8 
•0.2 
-7.0 
•4.6 
-7.0 
>0.0 
-16.3 
-28.7 
-13.7 



53.800 
40.700 
51,600 
51.700 
25,000 
53,700 
47,900 
61,300 
37,300 
23.800 
26.100 
56,100 
42,500 
38.300 
49.700 
55.500 
43,500 
44.500 
160,800 
34,100 
48.700 
36,500 
50,800 
37,400 
65.200 
42,900 
15.30C 
13.900 
36,000 
33,500 
42,600 
86,100 
37.30C 
57.000 
40,700 
53,800 
29.700 
36,000 
16,800 
28.100 
37,700 
43.700 
43,200 
40.700 
15,800 
33,800 
77,900 
29.800 
51,600 
55.300 
26,500 
50,800 
13.700 
40.500 
117,000 
33,900 
62.100 
56.600 
91,400 
44,400 
162.400 
65,900 
37,800 
54,600 
40,000 
13,700 
38.400 
51,700 
164.900 
50,400 
44.700 
53.500 
71,800 
32.100 
19.300 



MSN 


x 


v 

T 




SOSMW 


174 


1364 


ton 




162,900 


175 


825 




-15.7 


69,300 


177 


> JOfc 


dad 


-3.6 


52,600 


178 


1 t*C 1 


I iw 


-7.2 


43,000 


179 


1089 


CI K 


-10.4 


48.300 


180 


i no 


5o7 


-0.5 


51.600 


lOl 


411 


295 


-32.1 


91,200 


1A9 
iO* 


004 


730 


-16.2 


42.000 


184 


1860 


896 


-0.6 


34.500 


185 


1997 


1017 


>0.0 


29,800 


186 


279 


1113 


<-35.0 


26.300 


187 


773 


296 


-17.0 


90,800 


188 


1538 


807 


-4.2 


38,400 


191 


1560 


674 


-3.9 


44,900 


192 


1818 


687 


-0.9 


44,200 


193 


1469 


555 


-5.0 


52.400 


194 


1380 


266 


-6.4 


101,600 


195 


784 


632 


-16.7 


47,300 


196 


1227 


1185 


-8.4 


23.700 


197 


667 


553 


-20.1 


52.600 


198 


2006 


681 


>0.0 


44.500 


199 


1711 


674 


-2.2 


44.900 


200 


872 


424 


-14.7 


65,000 


201 


292 


435 


<-35.0 


63.700 


202 


736 


253 


-18.0 


107,800 


203 


786 


829 


-16.7 


37.400 


204 


1224 


569 


-8.5 


50.000 


205 


439 


983 


-30.9 


31,100 


206 


1994 


571 


>0.0 


51.300 


207 


1895 


687 


-0.3 


44.200 


208 


240 


1416 


<-35.0 


15.800 


210 


1700 


499 


•2.3 


57,000 


211 


902 


517 


-14.1 


55.400 


213 


1067 


684 


-10.4 


44,400 


214 


1340 


668 


-7.0 


45,200 


215 


1591 


495 


-3.5 


57.300 


216 


1585 


755 


-3.6 


40.700 


217 


1159 


393 


-9.3 


69.300 


218 


931 


572 


-13.5 


51.200 


219 


713 


177 


-18.7 


170.500 


220 


1479 


911 


-4.9 


33.900 


221 


965 


927 


-12.8 


33.300 


223 


934 


716 


-13.5 


42,700 


225 


1812 


1045 


-1.0 


28.800 


226 


821 


411 


-15.8 


66,800 


227 


1586 


1483 


•3.6 


13,600 


228 


1065 


567 


-10.8 


51.600 


229 


1577 


890 


-3.7 


34.800 


230 


1458 


496 


-5.2 


57,300 


232 


1440 


849 


-5.5 


36.500 


234 


1692 


489 


-2.4 


57.900 


235 


618 


1004 


-22.0 


30,300 


236 


920 


1138 


-13.7 


25,400 


237 


952 


1008 


-13.1 


30.200 


238 


1611 


541 


-3.2 


53.500 


239 


1489 


720 


-4.8 


42.500 


240 


501 


448 


-27.7 


62.100 


241 


1820 


569 


•0.9 


51,400 


242 


1357 


658 


-6.8 


45,800 


243 


711 


1182 


-18.7 


23,800 


244 


1855 


621 


-0.6 


48.000 


245 


1189 


474 


-8.9 


59.300 


246 


551 


459 


-25.1 


61,000 


247 


1348 


604 


-6.9 


49.100 


248 


460 


448 


-29.3 


62.100 


249 


1733 


451 


-1.9 


61.800 


250 


1974 


788 


>0.0 


39.200 


251 


808 


392 


-16.1 


69.500 


252 


874 


553 


-14.6 


52.500 


253 


753 


848 


-17.6 


36.500 


254 


995 


450 


-12.1 


61.900 


255 


1690 


679 


•2.4 


44.600 


256 


994 


1006 


-12.1 


30.200 


257 


508 


464 


-27.4 


60.400 


258 


1517 


820 


-4 4 


37.800 



a) Master table of proteins in the rat liver database, showing spot master number, gel position {x and y), isoelectric point relative to CPK standards and 
predicted molecular mass (from the standard curve of Fig. 8). 
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MSN 


X 


Y 


CPKDl 


SDSMW 


259 


1796 


961 


-1.1 


31.900 


260 


661 


1361 


•20.4 


17,700 


cv 1 


1725 


679 


•2.0 


44,600 


5fi9 

CM 


496 


1127 


-28.0 


25.800 


263 


1063 


172 


-10.9 


177.400 


965 

COS 


1390 


673 


-6.3 


45,000 




510 


437 


-27.3 


63.400 


9fi7 
CO/ 




1038 


•20.4 


29,000 


«uO 


430 


961 


-31.0 


31.900 


9CG 
ZW 


4 ai4 


606 


-11.2 


48,900 


2/U 




853 


>0.0 


36,300 


271 




422 


-15.0 


65,200 


Z7Z 


HQS 


966 


-14.2 


31,700 


2/4 


1 909 


712 


•7.6 


42,900 


275 


1 J9V 


590 


-6.9 


49,900 


276 


« C7A 


1069 


-2.6 


27,100 


277 


boo 


MO 


•19.4 


53,700 


278 


961 


7111 
/1C 


-13.0 


42.600 


279 


879 


CTft 

570 


-14.5 


51 300 


261 


1846 


lUo4 


-0.7 


27 300 


282 


4 g/\C 

1505 


5Z9 


-4.6 


54 800 


283 


1313 


1147 


•/ .j 


9*. Iftft 

O, 1 UV 


284 


1314 


oZS 


-7 a 

*/ .J 


37 4/Vi 

J / ,*UW 


285 




4O0 


-7 1 


K7 9O0 

O /,CwU 


Zoo 


iZ/ / 


CM 
ODZ 


-7 fi 


AjX 1ftA 
1 UU 




IJ9I 


824 


-6.3 


37 600 


ZOV 


1147 
1 1*/ 


STB 

9/9 


-9.5 


50 700 






911 


-13.6 


55 900 


291 


lot 


147C 

14/0 


•16.6 


1 1 qaa 


292 


4 ^IC9> 
|4oZ 


OlO 


-5.1 


^.7 firm 
j / .ouu 


293 


Ml 


440 


-26.3 


fi? ono 


294 


860 


©90 


-14.9 


1"» CAA 


295 


1 102 


Owv 


-9.3 


■*C, / UU 


296 


216 


814 


<-J9.U 


4D atv* 
Jc.uuu 


297 


1377 


ATA 


£T C 
**3.9 


^1 ^AA 
Ol ,JUU 


299 


913 


4 C44 

15Z3 


-1^ Q. 


1 *,4UU 


300 


2012 


667 


"*ft ft 

>u.u 


49,JUu 


301 


702 


4 fO 

178 


.10 ft 




302 


494 


4 OOA 

IZov 


.00 4 




303 


403 


4 nna 
lUUO 


•ot.o 


1ft 1ftft 


304 


1843 


4 gag 
1 509 


^.7 


1ft Wl 


305 


1049 


583 


-ii i 




306 


1606 


989 


~i i 
-j.j 


Jv t VUU 


307 


1219 


91 0 


-0.9 


JO, AW 


308 


1627 


755 


J ft 




4ao 

JUS 


i C44 

1 524 


BUZ 


-4.4 


TOO 


J1D 


4 ?CA 
1 /09 


1UZD 


-1 .5 


9Q 400 
*5r ,4\PU 


*1 1 

Jl 1 


icao 


14*1 
1491 


-3.3 


14 700 






14ftA 

1 4W 


<-35.0 


16 100 


aia 


10A9 
1 9UC 


1365 


-0.3 


17,600 


314 


1316 


1395 


.7.3 


16,600 


315 


1341 


523 


-7.0 


54 ,900 


316 


1104 


1053 


-10.1 


26,500 


320 


1480 


1459 


-4.9 


14,400 


3.91 

OC I 


850 


603 


-15.1 


49,100 


^99 

JC« 


1454 


1494 


•5.3 


13,300 


jcj 


Oris 




-20.0 


47,700 


JZ4 


099 


101 
1 VI 


-20.6 


420 500 


JZ5 


1 C91 
1 9*1 


R7* 
0/9 


-4.4 


AA OOft 

44,(W 


oA 


1 30/ 


1577 
O/ / 


-3.6 


44 TOO 


3Z/ 


l Joe 


4AQ 
4U9 


-6.3 


fi7 ftf» 


•a oft 


44fi 


1901 


. -30.0 


20,100 


"WO 


1 AOA 
1 DUO 


751 

/9 1 


-3.3 


40,900 


JJ 1 


1 tec 


697 


-3.6 


43,700 


332 


531 


471 


-26.3 


59,600 


333 


784 


1156 


-16.7 


24,700 


334 


1059 


407 


-10.9 


67,300 


335 


1593 


303 


-3.5 


88.500 


336 


1616 


598 


•3.2 


49,400 


336 


1854 


1004 


-0.6 


30,300 


339 


1265 


888 


4.0 


34.900 


340 


581 


585 


-23.6 


50.300 


341 


1497 


1047 


-4.7 


26.700 


343 


1351 


265 


-6.8 


102.200 


344 


1813 


549 


-0.9 


52.600 



MSN 


X 


Y 


CPKpl 


SDSMW 


345 


1006 


578 


•11.9 


50,800 


346 


1095 


640 


-10.3 


46,800 


347 


625 


728 


-21.7 


42.000 


348 


361 


963 


-35.3 


31,100 


349 


110 


1343 


<-35.0 


16,300 


350 


521 


1130 


-26.7 


25,700 


351 


912 


619 


-13.9 


46,100 


352 


1574 


530 


^3.7 


54,300 


353 


961 


912 


-12.9 


33,900 


354 


706 


762 


-18.9 


40,400 


355 


1450 


830 


-5.3 


37 300 


356 


1374 


1 1 £9 




&4,94AS 


357 


474 


997 


-98 7 


30 600 


358 


796 


34fi 


.4 C 4 

-10. J 


77 ftftfl 


359 


764 


000 


-17 ^ 

- f / ,j 


7B ADO 

/9,4%M 


360 


1384 


1 wo 


^.4 


97 QOO 


V51 


171 1 
1/10 


7RO 


9 4 

-Z.I 


Aft 1 SY^ 




1 1£1 


ACQ 
099 


.O ** 


JO.l W 


363 




1 1 


"1 J.O 


94 noo 


364 


^19 


409 


^9 ft 


c *> 7ftfl 
OJ,/VA^ 


365 


7^1 
/4 1 


AAA 
400 


17 O 

-1 /.W 


CO 90ft 




0/0 


19UJ 


1i ft 


1 *3 ftftft 


367 


1 sw 




0 


•in noo 

JJ,MA^ 




VOJ 


^90 


19 4 


CC 900 


•ICQ 


*J*t 


441 


-J 1 ,u 


cq nnn 


370 


639 


610 


91 9 


4A 700 
40, /^v 


^71 


1 30/ 


oov 


4 C 
-J.O 


^c i fVl 
JO.lUU 


^79 


1 0/D 


/oZ 


-0.9 


4U.4UU 


J/o 


i J51 


1 059 


-6.8 


28,300 


J/4 


1 5VX) 


71b 


-4.6 


42,700 


J /5 


1823 


532 


•0.9 


54,200 


0/6 


254 


417 


<-35.0 


65,900 


377 


1409 


583 


-6.1 


50,400 


378 


621 


494 


-21.8 


57.500 


379 


1017 


595 


-11.7 


49.600 


381 


953 


598 


-13.1 


49.400 


382 


856 


674 


-15.0 


44,900 


383 


1252 


258 


^8.1 


105,300 


J04 


1 coo 


1 51 8 


•2.3 


12,500 


IPC 


1fU4 


493 


-11.2 


57,500 


•lac 




583 


-4.7 


50,400 


^B7 
JO/ 


1954 


603 


■4.0 


49.100 


4Cfl 

JOO 




404 


•8.9 


67,700 


JW 


1374 


902 


-6.5 


34.300 


JW 


4 ^C£ 


969 


-5.2 


31,700 


Jot 


/lO 


con 


teg 
-18.5 


44,000 




1 TOO 


/JZ 


-1 .1 


A 4 AAA 

41 ,VUU 


Jo J 


1 AR9 
14Cc 


/5o 


^.8 


4U,OUU 


394 


1 997 
l &&/ 


1401 




AA Aftft 


395 


1530 


3/ / 


■4. J 


CA BAA 


396 


1 410 


7CC 
/DO 




Aft BAA 


397 


912 


256 


.11 Q 


106 400 


399 


1465 


1063 


•5.0 


28 100 


400 


1473 


450 




fil QAA 


401 


1029 




.11 c 

-1 1 .9 


OC *»AA 
JUU 


403 


1516 


7cj 

/94 


A A 

^.4 


4ft RAA 
4U,OUU 


404 


1495 


CCi 


-4. / 


C9 CAA 
9«;,9UU 


4A5 


159*. 


1ftQ9 
1 Use 


-4. J 


Z/.IUU 




79^ 
/*J 


Z5Z 


-18.4 


106,000 




ccn 
CSV 


DOJ 


-20.8 


if CAA 

45,900 


41ft 


1 FAI 
1 9V1 


4/0 


A C 

-4.0 


CO AAA 


41 1 


936 


1ft^7 
I U9/ 


1 1 A 

-1 J. 4 


OB 1AA 

ZO.JUU. 


412 


350 


1 ) <v 


•J9.9 


4C AAA 


413 


1033 


C-iO 
•7JO 


1 1 J 

-1 1 .4 


C4 fftft 
9J, rUU 


415 


737 


^9« 


IB ft 


A4 Qftft 


416 


1578 


606 


-3.7 


48 900 


417 


646 


496 


-21.0 


57!30O 


418 


1695 


482 


-2.3 


58,600 


419 


725 


770 


-18.3 


40,000 


420 


1289 


1041 


-7.7 


26.900 


421 


1171 


912 


-9.1 


33.900 


422 


599 


162 


•22.6 


193,700 


423. 


929 


856 


-13.6 


36.200 


424 


739 


625 


-17.9 


47,700 


425 


1490 


965 


-4.7 


31.800 



MSN 


X 


Y 


CPKDl 


SDSMW 


426 


1296 


704 


-7.0 


43.300 


427 


810 


843 


-16.0 


36,600 


428 


1565 


303 


-3.9 




429 


1259 


847 


-6.0 


■\g CAA 


430 


1253 


562 


-8.1 


Si Qftft 


431 


734 


1426 


•18.1 


1 ^ vn 


432 


483 


433 


•26.5 


en om 

wJ,9UW 


434 


518 


1041 


-26.9 


9fi Oftfl 

«0,3Mjy 


435 


1020 


1170 


-11.6 


94 vn 


436 


1122 


196 


•9.8 


147 CAA 
14/,OUW 


437 


1870 


673 


-0 5 


jC AAA 


438 




1 109 


41 ft 

-J I ,W 


^C ?AA 


439 


86 


S47 
©*/ 




4C C#4ft 

JO ,OOw 


440 


1740 




.1 fi 

•1 .0 


C4 AMI 

93,200 


441 




1 C71 

19/1 


99 fi 
-cZ.o 


10,800 


443 


74^4 

/4J 


JJ9 


.1 7 fi 

-1 /.O 


80,100 


A AC 


OU 1 


OCO 


4C 4 
-lO.Z 


45,200 


447 


1ftCA 




4 4 4 

-1 1 .1 


33.300 


AAO 
"O 


1 94< 
1 c 49 


1 4Afl 
1 Z90 


-6.Z 


19.800 


440 


1 ^7R 

1 9/0 


4 ClC 

1 9lO 


4 9 

-j. 7 


12,600 


450 


mm 
1010 


1 AOi 

lOZl 


_A O 


29,600 


4m 


1 A04 
1Uv4 


^4A 
44U 


4 A 4 

-1 0.3 


63,100 


459 


1949 


QAO 

ouZ 


- O A 

>0.0 


38,600 


4*L1 


1CC9 

105c 


Ov4 


-2.8 


34,600 


454 


4 J/M 
14UJ 


9UU 


-o.l 


56.900 


456 


1 ^Q4 
1 J94 


7m 
/ 10 


_C 4 


42.600 


*9/ 


OAC 
W9 


4JO 


-14.0 


63,500 


494* 


1038 


581 


-11.3 


50,500 


4CA 
400 


1598 


294 


-3.4 


91,400 


461 


1528 


863 


-4.3 


35,900 


462 


1098 


1137 


-10.2 


25.400 


463 


849 


1125 


-15.2 


25.800 


464 


1814 


1072 


-0.9 


27.800 


465 


1388 


481 


-6.3 


58,700 


466 


1194 


1084 


-8.9 


27.300 


468 


577 


467 


-23.9 


60.100 


469 


1 140 


888 


-9.6 


34,900 


470 


1797 


524 


-1.1 


54.800 


A^A 
4/1 


1293 


1133 


-7.6 


25,500 


4/Z 


618 


655 


-21 .9 


46,000 


4/J 


2009 


299 


>0.0 


89.900 


A*7A 
4/4 


1 205 


215 


-8.7 


131,300 


475 


1035 


788 


-11.4 


39.200 


4/0 


160 


155 


<-35.0 


207,600 


477 
4/ / 


4451? 


1370 


-28.9 


17,400 


47fi 
4/0 


coo 


CM 

662 


•22.8 


45,600 


47Q 
4/V 


A AAO 
IUU9 


540 


-11 .8 


53,500 


4AA 
40v 


1 91 A 

1 z 10 


235 


•8.6 


117,400 


4A9 

4<3t 


QIC 
OlO 


lit 
J4o 


-15.9 


77,800 




O9J 


C74. 
O/J 


•19.3 


44,900 


485 


1606 


1ft1^ 
1U1 J 


4 4 

"3.3 


30,000 


486 


478 




•cO.O 


49,300 


487 


1025 


607 


.11C 
•1 1 .9 


48,800 


488 


1045 


1 186 


-119 
*1 1 .Z 


23,700 


489 


1609 


JUl 


4 4 

-3.3 


89,200 




77*. 
/ /9 


1 9BO 
1Z09 


4 ^ ft 
•17.0 


20,100 


491 


fiQ9 


1 7n 
1 /o 


4 0 4 


169,300 




1 lOO 


OC4 
904 


-10.2 


31,800 


4Q 4 
4aO 


1 7fift 
1 /OU 


/7o 


•1.6 


39,700 




AR9 

00£ 


Z4/ 


•14.5 


110.700 


4QC 
*5f9 


47ft 
4/U 


4 

1 Z5e 


-28.9 


21,200 


4 OA 


404 
4W4 


1436 


•28.1 


15,200 


4Q7 


QAft 


ECO 
09Z 


-12.5 


36,400 


499 


1414 


C4C 
940 


-6.0 


53,100 


500 


1234 


1 072 


-0.3 


27,800 


501 


1246 


659 


_fl O 


A C ^ftf\ 
45,/UU 


502 


824 


792 


-15.7 


39,000 


503 


1246 


1134 


-8.2 


25.500 


504 


1115 


1407 


•9.9 


16,200 


505 


1189 


391 


-8.9 


69.700 


506 


1578 


402 


-3.7 


68,000 


507 


787 


250 


•16.6 


109.000 


508 
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Tabic 3. Computed pfs of two sets of carbamylated protein standards: Rabbit muscle CPK and human 
hemoglobin (Hb) 
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Table 4. Computed pfs of some known proteins related to measured CPK pfs 
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0 Creatine phospho kinase (CPK), rabbit muscle 

1 Fatty scid-binding protein, rat hepatic 

2 b2-microglobulin, human 

3 Cartamoyl-phosphate synthase, rat 

4 Prealbumin ( serum albumin precursor), rat 

5 Serum albumin, rat 

6 Superoxid dismutase (Cu-Zn, SOD), rat 

7 Phospholipase C. phophoinositide-specific (?), rat 

8 Albumin, human 

9 Apo A-l lipoprotein, rat 

10 proApo A-l lipoprotein, human 

11 NADPH cytochrome P-450 reductase, rat 

12 Retinol binding protein, human 

13 Actin beta, rat 

14 Actin gamma, rat 

15 Apo A-l lipoprotein, human 

16 Apo A- IV lipoprotein, human 

17 Tubulin alpha, rat 

18 F 1 ATPase beta, bovine 

19 Tubulin beta, pig 

20 Protein disutphide isomerase (PDI), rat hepatic 

21 Cytochrome b5, rat 
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A two-dimensional gel database of rat liver proteins 
useful in gene regulation and drug effects studies 

A standard two-dimensional (2-D) protein map of Fischer 344 rat liver 
(F344 M ST3) is presented, with a tabular listing of more than 1200 protein species. 
Sodium dodecyl sulfate (SDS) molecular mass and isoelectric point have been es- 
tablished, based on positions of numerous internal standards. This map has been 
used to connect and compare hundreds of 2-D gels of rat liver samples from a va- 
riety of studies, and forms the nucleus of an expanding database describing rat 
liver proteins and their regulation by various drugs and toxic agents. An example 
of such a study, involving regulation of cholesterol synthesis bv cholesterol-lower- 
ing drugs and a high-cholesterol diet, is presented. Since the map has been ob- 
tained with a widely used and highly reproducible 2-D gel svstem (the Iso-DaJt ? 
system), it can be directly related to an expanding bodv of work in other laborato- 
ries. 
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1 Introduction 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by OTarrell and others [1-4], has 
been used over the ensuing 16 years to examine a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products [5, 6]. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in vino systems are ideal for answer- 
ing many experimental questions, especially in cancer re- 
search and genetics, our experience with cell cultures and 
tissue samples suggests that some in vivo approaches could 
have major advantages. In particular, we have noticed that 
liver tissue samples from rats and mice appear.to show grea- 
ter quantitative reproducibility (in terms of individual pro- 
tein expression) than replicate cell cultures. This is perhaps 
a natural result of the homeostasis maintained in a com- 
plete animal vs. the well-known variability of cell cultures, 
the latter due principally to differences in reagents (e.g.! 
fetal bovine serum), conditions {e.g., pH) and genetic "evo- 
luiion^of cell lines while in culture. It is also more difficult 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached cells), forcing the inves- 
tigator to resort to radioisotope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain typically used for protein detec- 
tion in "large" protein samples, they are generally more vari- 
able, more labor-intensive and, in the case of radiographic 
methods, may generate highly "noisy" images, due to the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-40 (NP-40) solubilization and stained with CBB, which 
has the advantage of being easily reproducible [8). Finally, 
there remains the question of the "truthfulness" of many in 
vino systems as compared to their in vivo analogs; how 
great are the changes caused by the introduction into a cul- 
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lure and Ibe associated shift to strong selection for growth, 
and how do these affect experimental outcomes? Hence 
the apparent advantages of in vitro systems, in lerras of ex- 
perimental manipulation, may be counterbalanced by 
other factors relating to 2-D data quality. 

There is a second imponant class of reasons for exploring 
the use of an in vivo biological system such as the liver. His- 
torically, there have been two broad approaches to the me- 
chanistic dissection of biochemical processes in intact cel- 
lular systems: genetics (a search for informative mutants) 
and the use of chemical agents (drugs and chemical toxins). 
Both approaches help us to understand complex systems 
by disrupting some specific functional element and show- 
ing us the result. With the development of techniques for 
genetic manipulation and cloning, the genetic approach 
can be effectively applied either in vitro or in vivo, although 
the in vitro route is usually quicker. The chemical approach 
can also be applied to either sort of biological system; here, 
however, the bulk of consistently acquired information is 
in experimental animals (rats and mice). While most biolo- 
gists knowa short list of compounds having specific, experi- 
mentally useful effects (e.g., inhibitors of protein synthesis, 
ionophores, polymerase inhibitors, channel blockers, nu- 
cleotide analogs, and compounds affecting polymerization 
of cytoskeletal proteins), there is a much larger number of 
interesting chemically-induced effects, most of them char- 
acterized by toxicologists and pharmacologists in rodent 
systems. Just as a thorough genetic analysis would involve 
saturating a genome with mutations, it is possible to ima- 
gine a saturating number of drugs, the analysis of whose ac- 
tions would reveal the complete biochemistry of the cell. 
While organized drug discovery efforts usually target spe- 
cific desired effects, the nature of the process, with its de- 
pendence on screening large numbers of compounds, ne- 
cessarily produces many unanticipated effects. It is there- 
fore reasonable to suppose that the required broad range of 
compounds necessary to achieve "biochemical saturation" 
may be forthcoming; in fact, it may already exist among the 
hundreds of thousands of compounds that failed to qualify 
as drugs. 

Among organs, the liver is an obvious choice for the study 
of chemical effects because of its well-known plasticity and 
responsiveness. The brain appears to be quite plastic (e.g. 
[7]), but it is a complicated mixture of cell types requiring 
skillful dissection for most experiments. The kidney, while 
quite responsive, also presents a potentially confounding 
mixture of cell types. The liver, by contrast, is made up of 
one predominant cell type which is easy to solubilize: the 
hepatocyte, representing more than 95% of its mass. Most 
importantly, the liver performs many homeostatic func- 
tions that require rapid modulation of gene expression. It 
appears that most chemical agents tested affect gene ex- 
pression in the liver at some dosage (N. Leigh Anderson, 
unpublished observations), an interesting contrast to our 
earlier work with lymphocytes, for example, which seem to 
be much less responsive. Such results conform to the expec- 
tation that cells with a homeostatic, physiological role 
should be more plastic than ceils differentiated for a pur- 
pose dependent on the action of a limited number of spe- 
cific genes. 

The liver also allows the parallels between in vitro and in 
vivo systems to be examined in detail. Significant progress 



has been made in the development of mouse, rat and hu- 
man hepatocyte culture systems, as well as in precision-cut 
tissue slices. Using such an array of techniques, it is possi* 
ble to assemble a matrix of mammalian systems including 
mouse and rat in vivo on one level and mouse, rat and hu- 
man in vitro on a second level, and to compare effects be- 
tween species and between systems. This approach allows 
us to draw informed conclusions regarding the biochemical 
Universality" of biological responses among the mammals, 
and to offer some insight into the validity of in vitro ap- 
proaches for toxicological screening. We believe this data 
will be necessary if in vitro alternatives are to achieve wide 
usage in government-mandated safetytesting of drugs, con- 
sumer products and industrial and agricultural chemicals. 

A number of interesting studies have been published using 
2-D mapping to examine effects in the rodent liver. A num- 
ber of investigarors have made use of the technique to 
screen for existing genetic variants [8-11] or induced muta- 
tions [12-14], mainly in the mouse. This work builds on the 
wealth of genetic information available on the mouse and 
its established position as a mammalian mutation-detec- 
tion system. While some studies of chemical effects have 
been undertaken in the mouse [15-17], most have used the 
rat [18— 23]. The examination of the cytochrome p-450 sys- 
tem, in particular, has been carried out almost exclusively 
on the rat [24, 25]. 

These considerations lead us to conclude that rodent liver 
offers the best opportunity to systematically examine an 
array of gene regulation systems, and ultimately to build a 
predictive model of large-scale mammalian gene control. 
The basic underlying foundation of such a project is a reli- 
able, reproducible master 2-D pattern of liver, to which on- 
going experimental results can be referred. In this paper, we 
report such a master pattern for the acidic and neutral pro- 
teins of rat liver (pattern F344MST3). In future, this master 
will be supplemented by maps of basic proteins, and analog- 
ous maps of mouse and human liver. 



2 Materials and methods 
2.1 Sample preparation 

Liver is an ideal sample material for most biochemical stud- 
ies, including 2-D analysis. A sample is taken of approxima- 
tely 0,5 g of tissue from the apical end of the left lobe of the 
liver. Solubilization is effected as rapidly as practical; a 
delay of 5-15 min appears to cause no major alteration in 
liver protein composition if the liver pieces are kept cold 
(e.g., on ice) in the interim. In the solubilization process, 
the liver sample is weighed, placed in a glass homogenizer 
(e.g., 15 mL Wheaton); 8 volumes of solubilizing solution* 

• The solubilizing solution is composed of 2% NP-40 (Sigma), 9 m urea 
(analytical grade, e.g., BDH or Bio-Rad), 0.5% dithiothreitol (DTT; 
Sigma) and 2% carrier ampholytes (pH 9- 11 LKB: these come as a 20% 
stock solution. so 2 % final concentration isachieved by making the final 
solution 10% 9-1 1 Ampholine by volume). A large batch of solubilizer 
(several hundred mL) is made and stored Frozen at— 80°C in aliquots 
sufficient to provide enough for one day's estimated sample prepara- 
tion requirement. The solution is never allowed to become warmer 
than room temperature at any stage during preparation or thawing for 
use. since heating of concentrated urea solutions can produce contami- 
nants that covalently modify proteins producing artifactual charge 
shifts. Once thawed, any unused solubilizer is discarded. 
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is added (i.e., A ml per 0.5 g tissue) end the mixture is ho- 
mogenized using first the loose- and then then the tight-fit- 
ting glass pestle. This takes approximately 5 strokes with 
each pestle and is carried out at room temperature because 
urea would crystallize out in the cold. Once the liver sample 
is thoroughly homogenized in the solubilizer, it is assumed 
that all the proteins are denatured (by the chaotropic effect 
of ibe urea and NP-40 detergent) and the enzvmes inacti- 
vated by the high pH (-9.5). Therefore these samples may 
be kept at room temperature until they can be centrifuged 
or frozen as a group (within several hours of preparation) 
The samples are centrifuged for 6 X 10' g min (e.g., 500 000 
X g for 12 min using a Beckman TL-100 centrifuge). The 
centrifuge rotor is maintained at just below room tempera- 
ture (e.g., 15-2CTC), but not too cold, so as to prevent the 
precipitation of urea. The centrifuge of choice is a Beckman 
TL-100 because of the sample tube sizes available, but any 
ultracentrifuge accepting smallish tubes wijj suffice. When 
an appropriate centrifuge is not available near the site of 
sample preparation, samples can be frozen at -S0 c C and 
thawed prior to centrifugation and collection of superna- 
tant*. Each supernatant is carefully removed following cen- 
trifugation and aliquoted into at least 4 clean lubes for stor- 
age. This is done by transferring all the supernatant to one 
clean tube, mixing this gently (to assure homogeneous 
composition) and then dividing it into 4 aliquots. The ali- 
quots are frozen immediately at -S0°C These multiple ali- 
quots can provide insurance against a failed run or a freezer 
breakdown. 

2.2 Two-dimensional electrophoresis 

Sample proteins are resolved by 2-D electrophoresis using 
the 20 X 25 cm Iso-DaJt* 2-D geJ system ([26-29]; pro- 
duced by LSB and by Hoefer Scientific Instruments', San 
Francisco) operating with 20 gels per batch. AH firsi-dimen- 
sional isoelectric focusing (IEF) gels are prepared using the 
same single standardized batch of carrier amphoMes 
(BDH 4-SA in the present case, selected by LSB's batch- 
testing program for rat and mouse database work - *). A 10 
pL sample of solubilized liver protein is applied to each gel, 
and the gels are run for 33 000 to 34 500 volt-hours using a 
progressively increasing voltage protocol implemented by 
a programmable high-voltage power supply. An Ange- 
lique** computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
ium dodecyl sulfate (SDS) polyacrylamide gradient slab 
.gels in which the top 5 % of the gel is 1 1 %T acryiamide, and 
the lower 95% of the gel varies linearly from 1*1 % to 18%T. 

This system has recently been modified so as to employ a 
commercially available 30.8 %T acrylamide/AW-methyle- 
nebisacrylamide prepared solution (thus avoiding the han- 
dling of the solid acryiamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
Tris), persulfate and AW^AMetramethylethvlenedi- 
amine (TEMED). Each, gel is identified by a computer- 
printed filterpaperlabel polymerized into the lower left cor- 
ner of the gel. First-dimensional IEF tube gels are loaded 
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directly (as extruded) onto the slab gels without equilibra- 
tion, and held m place by polyester fabric wedges (Wed- 
gies , produced by LSB) to avoid the use of hot agarose 
Second-dimensional slab gels are run overnight, in groups 
of 20, id cooled DALT tanks (10'Q with bufTer circulation 
AH run parameters, reagent source and lot information 
and notations of deviation from expected results are ente- 
red by the technician responsible on a detailed, multi-pace 
record of the experiment. 

2.3 Staining 

Following SDS-electrophoresis, slab gels are stained for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1L of gel) per box. This procedure (based on the work 
ofNeuhofT[30 f 33]) involves fixation in l.SL of 50% etha- 
nol and 2% phosphoric acid for 2h, three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34% 
methanol, 17% ammonium sulfate and 2% phosphoric acid 
for 1 h, followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon eels are 
transferred to cool tap water and their surfaces rinsed to re- 
move any paniculate stain prior to scanning. Gels may be 
kept for several months in water with added sodium azide 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncolloidal, with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is diluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows' (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometry; (ii) up to 1500 spots can be reli- 
ably detected on many gels (e.g., rat liver) at loadings low 
enough to preserve excellent resolution; and (iii) reprodu- 
cibility appears to be very good: at least several hundred 
spots have coefficients of reproducibility less than 15% 
This value is at least as good as previous CBB methods, and" 
significantly better than many silver stain systems. 

2.4 Positional standardization 

The carbamylated rabbit muscle creatine phosphokinase 
(CPK) standards [32] are purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
taken from the Protein Identification Resource (PIR) se- 
quence database [33]. 



This material (succeeding certified batches of which are available from 
Hoefer Scientific instruments) has the most linear pH gradient pro- 
duced by any ampholyte tested except for the Pharmacia wide range 
(which has an unacceptable tendency to bind high-molecular weight 
acidic proteins, causing them to streak). 



2.5 Computer analysis 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
pnni prepared from the raw digital image as hard-copy 
backup of the gel image. Gels are processed using the Kep- 
ler* software system (produced by LSB), a commercially 
available workstation-based software package built on 
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some of the principles of the earlier TYCKO system [34- 
41]. Procedure PROC00S is used to yield i spethsi giving 
position, shape and density information for each detected 
spot. This procedure makes use of digital filtering, mathe- 
matical morphology techniques and digital masking to re- 
move the background, and uses full 2-D least-squares opti- 
mization to refine the parameters of a 2-D Gaussian shape 
for each spot. Processing parameters and file locations are 
stored in a relational datsbase.while various log files deal- 
ing operation of the automatic analysis software are ar- 
chived with the reduced cata.The computed resolution and 
level of Gaussian convergence of each gel are inspected 
and archived for quality control purposes. 

Experiment packages are constructed using the Kepler ex- 
periment definition database to assemble groups of 2-D 
patterns corresponding to the experimental groups (e.g., 
treated and control animals). Each 2-D pattern is matched 
to the appropriate -'master" 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
providing linkage to the existing rodent protein 2-D data- 
bases. The software allows experiments containing hun- 
dreds of gels to be constructed and analyzed as a unit, with 
up to 100 gels displayed on the screen at one time for com- 
parative purposes and multiple pages to accommodate ex- 
periments of > 1000 gels. For each treatment, proteins 
showing significant quantitative differences vj. appropriate 
controls are selected using group-wise statistical parame- 
ters (e.g., Student's t-test, Kepler* procedure STUDENT). 
Proteins satisfying various quantitative criteria (such as P< 
0.001 difference from appropriate controls) are repre- 
sented as highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations (i.e., logi- 
cal vectors) in a liver protein database. Quantitative data 
(spot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analysis of co- 
regulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinated regulated by 
any of the treatments. Such groups can be presented graphi- 
cally on a protein map,and reported together with the statis- 
tical criteria used to assess the level of coregulation. Multi- 
variate statistical analysis (e.g., principal components' ana- 
lysis) is performed on data exported to SAS (SAS Institute). 

2.6 Graphical data output 

Graphical results are prepared in GKS and translated 
within Kepler*' into output for any of a variety of devices. 
Linedrawing output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resolution 
Postscript-compatible Linotronic output device. Greyscale 
graphics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the standard 
orientation, with high molecular mass at the top and acidic 
proteins to the left. 

2.7 Experiment LSBC04 

In the study described here 12-week-old Charles River 
male F344 rats were used. Diets were prepared at LSB, 
based on a Purina 5755 M Basal Purified Diet. Lovastatin 
and cholestyramine were obtained as prescription pharma- 



ceuticals, ground and mixed with the diet at concentrations 
of 0.075 % and 1%, respectively. The high cholesterol diet 
was Purina 5801M-A (5% cholesterol plus 1 % sodium cho- 
late in the control diet). Animal work was carried out by Mi- 
crobiological Associates (Betbesda,MD). Animals were ac- 
climatized for one week on the control diet, fed test or con- 
trol diets for one week, and sacrificed on day 8. Average 
daily doses of lovastatin and cholestyramine in appropriate 
groups were 37 mg/kg/day and 5 g/kg/day, respectively, 
based on the weight of the food consumed. Liver samples 
were collected and prepared for2-D electrophoresis accord- 
ing to the standard liver protocol (homogeniiation in 8 
volumes of 9 m urea, 2% NP-40, 0.5% ditbiothreitol, 2% 
LKB pH 9-11 carrier ampholytes, followed by centrifuga- 
tion for 30 min at 80 000 X g). Kidney, brain and plasma 
samples were frozen. Gels were run as described above, 
and the data was analyzed using the Kepler* system. Gels 
were scaled, to remove the effect of differences in protein 
loading, by setting the summed abundances of a large num- 
ber of matched spots equal for each gel (linear scaling). 



3 Results and discussion 

3.1 The rat liver protein 2-D map 

F344MST3 is a standard 2-D pattern of rat liver proteins, 
based on the Fischer 344 strain. This pattern was initiated 
from a single 2-D gel and extensively edited in an experi- 
ment comparing it to a range of protein loads, so as to in- 
clude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D patterns 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) have 
been added as a result. A modified version including addi- 
tional spots present in the Sprague-Dawley outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 10 uLof solubi- 
lized liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear in the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant. Fig. 4 the upper right (basic, 
high molecular mass) quadrant, Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units),isoelectric point (relative to the CPK in- 
ternal p/standards) and SDS molecularmass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table 1). Be- 
cause of the precision of the CPK-p/ values, these parame- 
ters can be used to relate spot locations between gel sys- 
tems more reliably than using p/ measurements expressed 
as pH . A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pro- 
teins, as well as rigorous definitions of subcellular orga- 
nelle contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse liver 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results of these studies will be 
presented systematically in a later edition of this database, 
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but we include here a useful series of 22 orieming identifi- 
cations as an aid to other users of the rat liver pattern (Table 
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32 Carbamvlsied charge standards, computed p/s s „d 
molecular mass standardization 

We have previously shown that the use of a svstem of dose- 
ly-spaced internal p/ markers (made by carbamvlat ng a 
basic protein) offers an accurate and workable sc'lu on to 
the problem of assigning positions in the p/ dimension"'? 
The same system based on 56 protein species made b • ca"r- 
bamylat.ng rabbit muscle CPK, has been usee here to - 
sign p/ s to most rat liver acidic and neutral proieins The 
standards were coelectrophoresed with total liver pro e ins 
and the standard spots added to a special version of the 
master pattern F344MST3. The gel ^-coordinates of all 
liver protein spots lying wuhin the CPK charge train were 
then transformed into CPK p/ positions bv interpolation 
between the positions of immediately adjacent standards 
(Table 1) using a Kepler* vector procedure. 

It has proven possible to compute fairly accurate pi values 
for many proieins from the amino acid composition [421 
We have attempted here to test a further elaboration of his 
approach, in which we computed p/s for the CPK standards 
themselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact that adjacent members of the 
charge tram typically d.ffer by blockage of one additions Vv- 
sine residue (Table 3). We compared these v Sju « lc ™ £■ 
computed p/s for an additional set of carbarn vlaied'sSn" 
ards made from human hemoglobin beta chains and a se 
nes of rat liver and human plasma proteins of known posi- 
tion and sequence (Fig. 7,Table 4).The result demonstrate 
good concordance between these systems. Two proteins 

S^l lg !!f 1Ca x^ i0nS:]iverfa,,y - acid Ending protean 
(FABP; #1 ,„ Table 4) and protein disulphide isome a« 
(*20 in the table). The FABP spot present on F344™ST3 
may represent a charge-modified version of a more basic 

lEM&Vt 0 ™ ,0 ,! he , ex P ec ' cd PA ™t "solved in the 
1EF/SDS gel. Of particular importance is the fact that bv 
comparing computed p/s of sequenced but unlocated pro- 
teins with the CPK p/s, we can assign a probable gel "loci 
non without makjng any assumptions regarding the actual 
gel pH gradient. This offers a useful shortcut, given the va- 
garies of pH measurement on small diameter IEF gels We 
have used this approach to compute the CPK p/s of all rat 
and mouse proteins in the PIR sequence database, as an aid 
lo protein identification (data not shown). 



Coordinate, fl is 51 l.S3.fc is -0.2731 and cis 33183801 Th, 

JSSVh m i at has the efTect of removin * *ok«l o1 

from the gut-hve, -recirculation); and cholesterol itself The 
firs two agents should lower available cholesterol and he 
third should raise ,t, allowing manipulation of recant 
gene expression control systems in both directions. SuA 
.n experiment offers an interesting test of the 2-D mapping 
>s em since most of the pathway enzymes are present 2 
ow abundance, many are membrane-bound and diS 

el 0 U oi ' c 2 ro. a e nd pathWa> ' itSC!f iS complex. AppJS 
te^ 1000 proteins were separated and detected in liver ho- 

bv a g M ateS - Twemy - 0ne Pr0,eins *™ to be affected 

Ever.?? 1 0n V rea T enI - a " d «hese could be divid d £to 
several coregulated groups. 

3.3.1 MSN 413 (putative cvtosolic HMG-CoA svmhase) 
and sets of spots regulated coordinate!* or inversely 



In order to standardize SDS molecular weight (SDS-MW) 
we have used a standard curve fitted to a series of identified 
proteins (Fig. 8). Rather than using molecular mass perse 
we have elected to use the number of amino acids in the 
polypeptide chain, as perhaps a better indication of the 
length of the SDS-coated rod that is sieved by the second 
dimension slab. The resulting values were multiplied bv 
112 (the weighted average mass of amino acids in se- 
quenced proteins) to give predicted molecular masses Be- 
cause we use gradient slabs, we have not constrained the fit- 
ted curve to conform to any predetermined model- rather 
we tried many equations and selected the best usine the 

equation chosen was v 
— 0 + DX+ c/jt, where .vis the numberof residues,* is the gel 



solicHMr r ■ f lS (, " c,udin e a sP°t assigned to the cyto- 
sohcHMG-CoAsynthase.MSNMnishowed the expected 
ncrease m abundance with lovastatin or chole«mS„ e 

n n e / glS H C ? nh " inCreaSe with lovasl atin and crS 

S Soot' n n u m a h e ra Tn " * i,h lhe hi * h cho >« 

Tein in ,h number413 15 th e most strongly regulated pro- 
tern .n the present experiment, showing a 5 : to 10-fold in- 

f % 'ho.' , Cra 1 Week ' lrealmen ' With °- 075 % 'ovasta S ai d 
in c f h ° esI >" ram,ne the diet (Figs. 9 and 10). Its expres- 
-on follows precisely the expectation for an enzyme whose 
abundance ,s controlled by the cholesterol level- iUs „r 0 . 
gressiveiy increased from the control levels by cho est?™ 
mine, lovastaun and lovastatin plus cholestyramine and ft 
sinks below the threshold of detection in animals fed "the 
high cholesterol diet. This spot has been ternary Semf- 
fied as the cytosolic HMG-CoA synthase, based In a reac- 
tior , wuh an antiserum to that protein provided by Dr Mi 

ator e?^ n : Pan " ^ ^ & Dohme Research Labo. 
ratones.This enzyme lies immediatelv before HMG-CoA 
reductase m the liver cholesterol biosynthesi pathway and 
k , nown to be co-regulated with it. Spot 413 has^n SDS 
molecular weight of about 54 000 and J .CPK p/of - 11 4 in 
reasonably close agreement with a molecular we sht of 

ouen ceo? if? ^ f '° m ^ ~ 

quence of the hamster enzyme |43]. 

Using a classical product-moment correlation test (Kepler 

found to be coregulated with 4 13. The level of correlati n 
was exceedingly high (> 95%). Two of these, 1250 and 933 
are a. similar m lecu ,„ wejghls >nd . 

charge more acdic than 413 (Fig. 9), indicating that they 
may be covalently modified forms of the 413 polvpep tide 
This suspicion is strengthened by the observation ,'hat both' 

CoAsvmh 3 S° ^ ,ned b " V thC ami00dy 10 ^o^lic HMG- 
CoA synthase.The remam.ng three correlated spots appear 
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to comprise an additional related pair (1253 and 1003) of 
around 40 kDa and a single spot (1 3 39) of around 2f kDa. 
Because these two presumed proteins are present at sub- 
stantially lower abundances than 413,and because the cyto- 
solic HMG-CoA svnthase is reported to consist of only one 
tvpe of polypeptide, they are likely to represent other, very 
tightly coregulated enzymes. A second group of six spots 
wa< selected based on a regulatory pattern close to the in- 
verse of that for spot 413 (MSN's 34,79, 178, 182. 204.347; 
data not shown). For these proteins, the lowest level of ex- 
pression occurs with exposure to lovastatin plus cholestyra- 
mine and the highest level upon exposure to the high-cho- 
lesterol diet. Spots 182 and 79 are highly ccnelatec ana lie 
about one charge apart at the same molecular weight; they 
may thus be isoforms of a single protein. The other four 
spots probably represent additional enzymes or subunits. 

3 3.2 MSN 235 and coregulated spots 

A third group of five spots, mainly comprised of mitochon- 
drial proteins including putative mitochondrial HMG- 
CoA svnthase spots, showed a modest induction by lovasta- 
tin alone, but little or no effect with any of the other treat- 
ments (including the combination of lovastatm and choles- 
tvramine; Fig. 12). This result is intriguing because lovasta- 
tin was expected to afTect only the regulation of enzymes of 
cholesterol svnthesis, which is entirely extra-mitochon- 
drial. Three of the spots (235, 134, 144) form a closely- 
packed triad at approximately 30 kDa, and are likely to re- 
present isoforms of one protein. All three spots are stained 
by an antibody to the mitochondria] form of HMG-CoA 
synthase obtained from Dr. Greenspan. Subcellular fractio- 
nation indicates a mitochondrial location. The other two 
spots (633 at about 38 kDa and 724 at about 69 kDa) are 
each present at lower abundance than the members of the 
triad. 



proteins of the putative mitochondrial pathway are so 
much more variable in their expression in all groups. An ex- 
amination of all the coregulated groups suggests that quan- 
titative statistical techniques can extract a wealth of inter- 
esting information from large sets of reproducible gels.The 
abundance of spots in the 413 coregulation group, for exam- 
pie, shows an amazing level of concordance in their relative 
expression among the five individuals of the lovastatin and 
cholestyramine treatment group. This effect is not due to 
differences in total protein loading, since they have already 
been removed by scaling, and since proteins with quite dif- 
ferent regulation patterns can be demonstrated (e.g., Fig. 
13).Such effects raise the possibility that manygene coregu- 
lation sets may be revealed through the study of a suffi- 
ciently large population of control animals (/.e\, without 
any experimental manipulation). This approach, exploiting 
natural biological variation in protein expression instead of 
drug effects, offers an important incentive for the construc- 
tion of a large library of control animal patterns. 



4 Conclusions 

Because of the widespread use of rat liver in both basic bio- 
chemistry' and in toxicology, there is a long-term need for a 
comprehensive database of liver proteins. The rat liver mas- 
ter pattern presented here has proven to be an accurate re- 
presentation of this system, having been matched to more 
than 700 gels to date. As the number of proteins identified 
and the number of compounds tested for gene expression 
effects grows, we expect this database to contribute valu- 
able insights into gene regulation. Its practical utility in sev- 
eral areas of mechanistic toxicology is already being de- 
monstrated. 

Received September 11, 1991 



3.33 An example of an anti-synergistic effect 

A sixth spot (367) shows strong induction by lovastatin 
(two- to threefold), and about half as much induction with 
lovastatin plus cholestyramine, but without sharing the ani- 
mal-animal heterogeneity pattern of the 235-set (Fig. 13). 
This protein is also mitochondrial, and represents the clear- 
est example of an anti-synergistic efTect of lovastatin and 
cholestyramine. The existence of such an effect demon- 
strates that lovastatin and cholestyramine do not act exclu- 
sively through the same regulatory pathway. 

33.4 Complexity of the cholesterol synthesis pathway- 
Taken together, these results suggest that treatment with lo- 
vastatin alone can affect both cytosolic and mitochondrial 
pathways using HMG-CoA, while cholestyramine, on the 
other hand, either alone or in combination with lovastatin, 
produces a strong effect on the putative cytosolic pathway, 
but little or no effect on the putative mitochondrial path- 
way. An explanation for this difference may lie in lovasta- 
tin's effect on levels of HMG-CoA and related precursor 
compounds that are exchanged between the cytosol and 
the mitochondrion, whereas cholestyramine should affect 
only the cytosolic pathways directly controlled by cholester- 
ol and bile acid levels. It remains to be explained why some 
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6 Addendum 1: Figures 1-13 
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Figure J. Synthetic representation of the standard rat liver 2-D master pattern, rendered as a greyscale image using a videoprinter. 
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Figure}. Schematic representation of the master pattern (the same as Fig. 1), useful as an aid in relating specific areas ofFig. 1 and ihe following detailed 
quadrants. 
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Figuret. Plot of number of amino acids versus gel K-posiiion, with fitted 
curve used to predict molecular mass of unidentified proteins. 
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•4 Figure 7. (a) Plot of computed isoelectric point versus gel /-position for 
two sets of carbamylated standard proteins (rabbit muscle CPK (+} ind 
human hemoglobin 0 chain, filled diamonds) and several other proteins 
(shaded squares), (b) The identities of the various proteins represented 
by the squares are indicated by the numbers in corresponding positions 
on (a); these refer to Table 4. 
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fi^u/f P. Montage showing effects in the 
region of MSN :4 13. The montage shows a 
small window into one portion of the 2-D 
pattern, one row of windows for each expe- 
rimental group, and one panel for each gel 
in the experiment. The left-most pattern 
in each row is a group-specific copy of the 
master pattern followed by the patterns 
for the five individual rats in the group. 
The highlighted protein spots (filled circ- 
les) are spot 413 (on the right of each pan- 
el; identified as cytosolic HMG-CoA syn- 
thase) and two modified forms of it (1250 
and 933). From the top, the rows (experi- 
mental groups) are: high cholesterol, con- 
trols, cholestyramine, lovastatin, and lova- 
statin plus cholestyramine. 
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Figure JO, Bargraph showing the quantita- 
tive effects of various treatments on the 
abundance of MSN:413 (cytosolic HMG- 
CoA synthase) in the gels of Fig. 9. 
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Figure J J. Bargraphs of a series ofsix core- 
gulated spots including MSN:413. In the 
bargraphs. the abundances of the appro- 
priate spot (master spot number shown at 
the top of the panel) in each animal are 
shown. The five five-animal groups are in 
the order (left to right): high cholesterol, 
controls, cholestyramine, lovastatin, and 
lovastatin plus cholestyramine. Each bar 
within a group represents one experimen- 
tal animal liver(one 2-D gel).Note the cor- 
related expression of the 6 spots, espe- 
cially in the two far right (most strongly in- 
duced) groups. 
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Figure 13. Data on spot MSN:367, presented as in Fig. 11. This proiein 
shows unambiguously the anti-synergistic efTeci of lovasialin and choles- 
tyramine (fifth group) as compared to lovasialin (fourth group). This res- 
ponse contrasts strongly with the regulation pattern seen in Fig. 11. 
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Table 1. Master able of proteins in the rat liver database" 



Daubuc or rat liver proteins 
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MSN 



Y CFK0I SOSMW 



MSN 



Y CFKol SDSMW 



3 
5 
8 

11 
15 
17 
16 
19 
20 
21 
22 
23 
24 
25 
27 
28 
29 
30 
32 
-33 
34 
35 
36 
38 
39 
41 
42 
43 
44 
46 
47 
46 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
65 
66 
67 
68 
69 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
69 
90 
91 
92 
93 
94 



311 
568 

ei2 

549 
645 
629 
906 
755 
649 
1204 
332 
787 
313 
807 
1184 
1263 
743 
766 
1216 
1145 
1037 
6G3 
712 
763 
304 
1165 
664 
1318 
1924 
1203 
1391 
309 
605 
621 
1113 
1620 
725 
2001 
722 
678 
1662 
1091 
1171 
1400 
1853 
1888 
735 
1263 
1252 
779 
1064 
656 
636 
1562 
1570 
1264 
1336 
1633 
1767 
925 
534 
1811 
1412 
1471 
1662 
1596 
1817 
516 
1589 
1706 
651 
1415 
1773 
1338 
1706 



434 

263 
426 
266 
520 
589 
414 
298 
403 
448 
434 
424 
417 
516 
524 
446 
605 
112 
417 
445 
555 
412 
606 
694 
470 
569 
607 
589 
362 
566 
447 
454 
567 
535 
522 
499 
177 
500 
630 
533 
302 
580 
585 
624 
506 
567 
297 
312 
407 
692 
296 
589 
545 
583 
556 
621 
564 
363 
565 
736 
698 
363 
661 
347 
563 
479 
301 
1371 
696 
719 
329 
710 
545 
446 
696 



«-35.0 
-24.3 
-16.0 
•25.2 
-15.3 
-21.6 
-14.0 
-17.5 
-20.9 
-8.7 

<-35.0 
•16.6 

<-35.0 
-16.1 
•9.0 
-8.0 
-17.8 
-17.2 
-6.6 
-9.5 
-11.3 
-14.9 
-18.7 

-17.3 

<-35.0 
•9.2 

-19.6 
-7.3 
-0.1 
-8.7 
-6.3 

<-35.0 

-22.5 

-21.8 

-10.0 
-0.9 

-18.3 

>0.0 

-16.4 

-19.8 
•2.5 

-10.3 
-9.2 
-6.2 
-0.6 
-0.4 

-18.1 
4.0 

-6.1 

•16.8 

-10.8 

•20.6 

-21.2 

-3.6 

•3.8 

-6.0 

•7.0 

-0.8 

-1.5 
-13.6 
-26.1 

-1.0 

-6.0 

-5.0 

-2.7 

-3.4 

-0.9 
-27.0 

-3.5 

-2.2 
-20.6 

-6.0 

-1.4 

-7.0 

•2.2 



63,800 
102.900 
64.600 
101.000 
5E.200 
50.000 
66.300 
90.200 
67.900 
62,100 
63.800 
65.000 
66.000 
55.500 
54,900 
62.400 
49.000 

346.600 
66,000 
62.500 
52.400 
66.600 
48,900 
43,800 
59,800 
51,400 
4£,600 
50.000 
74,600 
50.200 
62.300 
61,500 
50,100 
53.900 
55,000 
57,000 

170.600 
56.900 

37.300 
54.100 

89.000 

50.600 

50.300 

47.800 

56.200 

51.500 

90,500 

85.900 

67,300 

43.900 

90,800 

50.000 

53,100 

50,400 

52.300 

4£.000 

51,600 

74.400 

51.700 

41.600 

43.600 

74,500 

44,500 

77.500 
51,800 
58,900 
89,100 
17.400 
43.600 
42,500 
81.700 
43.000 
53.200 
62.300 
43.700 



95 
96 
97 
98 
99 
100 
101 
102 
103 
104 
105 
106 
107 
106 
109 
110 
111 
113 
114 
115 
116 
117 
118 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
146 
149 
150 
151 
152 
153 
154 
155 
156 
157 
156 
159 
160 
161 
162 
164 
166 
167 
168 
169 
170 
171 
172 
173 



1119 
1731 
1033 
1406 
£76 
2004 
1106 
462 
665 
773 
312 
1769 
1585 
16G2 
1462 
778 
1728 
1191 
1298 
662 
1146 
1546 
1050 
1530 
836 
1572 
23 
621 
1296 
672 
1000 
1229 
1422 
1776 
1930 
660 
666 
1271 
1161 
453 
1858 
1504 
1468 
1689 
311 
1366 
1429 
615 
2006 
2006 
1070 
1347 
541 
1645 
1269 
1507 
1722 
932 
1031 
1970 
1258 
1275 
1663 
1034 
1953 
1020 
1566 
1905 
1340 
1506 
1338 
1969 
800 
476 
919 



536 
756 
566 
565 
1149 
538 
623 
455 
630 
1162 
1117 
509 
720 
807 
593 
516 
700 
660 
185 
907 
610 
849 
577 
628 
423 
712 
1433 
1474 
862 
921 
717 
311 
832 
499 
757 
537 
1019 
862 
1389 
1063 
823 
697 
707 
756 
1417 
915 
346 
1017 
566 
516 
1106 
578 
1461 
760 
236 
911 
446 
503 
294 
6S4 
183 
417 
820 
527 
771 
1462 
606 
565 
181 
563 
678 
541 
378 
956 
1314 



-9.9 
-2.0 
-11.4 
-6.1 
-23.8 
>0.0 
-10.1 
•26.5 
-20.2 
-17.0 
<-35.0 
-1.5 
-3.6 
-2.4 
-4.8 
-16.9 
-2.0 
^6.9 
-7.5 
-19.6 
-9.5 
•4.1 
•11.1 
-4.3 
-154 
•3.6 
<-35.0 
-21.9 
•7.5 
-14.7 
-12.0 
-8.4 
-5.8 
-1.4 
-0.1 
•20.4 
-20.2 
-7.9 
-9.3 
-29.7 
•0.6 
-4.6 
-4.8 
-2.4 
<-35.0 
-6.7 
-5.7 
-22.1 
>0.0 
>0.0 
-10.7 
-6.9 
-25.7 
-2.8 
-7.9 
-4.5 
-2.1 
-13.5 
-11.4 
>0.0 
-8.1 
-7.6 
-2.6 
-11.4 
>0.0 
•11.6 
-3.8 
-0.2 
-7.0 
-4.6 
•7.0 
>0.0 
-16.3 
-26.7 
-13.7 



53,800 
40.700 
51.600 
51,700 
25.000 
53.700 
47.900 
61.300 
37,300 
23.800 
26.100 
56.100 
42.500 
38,300 
49.700 
55.500 
43,500 
44.500 
160.600 
34.100 
46.700 
36.500 
50,600 
37.400 
65.200 
42.900 
15.300 
13.90C 
36.00C 
33,500 
42.600 
86.ia 
37,3a 
57.CXX 
40,7a 

53 .ea 
29.7a 

36.0a 

16,8a 

26.1 OC 

377a 
43.7a 

43, 20C 

40,7a 
15,8a 
33,8a 
77,9a 

29,800 

51,6a 
55,3a 

26.5a 
50.800 

137a 
40,sa 
117.0a 
33.9a 
62,1 a 

56.6a 

91.4a 

44.400 

i62.4a 

65,9a 

37,sa 
54,6a 
40,0a 
13,7a 

38,400 

51,7a 

164,9a 
50,400 

44.7a 

53,500 

7i.8a 
32.1a 
19.3a 



MSN 



Y CPKOI SOSMW 



174 

175 



1364 
825 



177 1582 

178 1321 

179 1069 

180 1666 



181 
182 



184 1860 

185 1997 
186 
187 
186 
191 



163 
393 

553 
710 
615 
567 
295 
730 
896 
1017 
279 1113 



411 
804 



773 
1538 
1560 



192 1616 

193 1469 



194 

195 



1380 
764 



196 1227 

197 667 
196 2006 
199 1711 



2a 
201 
202 

203 



204 1 224 

205 439 

206 1994 

207 1 895 
208 
210 
211 
213 



296 
807 
674 
667 
555 
266 
632 
1185 
553 
681 
674 
424 
435 
253 
829 
589 
983 
571 
687 

240 1418 



872 
292 
736 
786 



17a 
902 
1067 



214 1340 

215 1591 



216 
217 
218 
219 



220 1479 

221 965 
223 934 
225 1612 
226 
227 

228 1 065 

229 1577 

230 1458 



499 

517 
684 
668 
495 
755 
393 
572 
177 
911 
927 
716 
1045 
821 411 
1586 1463 



1565 
1159 
931 
713 



232 
234 
235 
236 
237 



1440 

1692 
616 
920 
952 



236 1 611 
239 1 469 



240 
241 
242 
243 
244 



501 
1820 
1357 

711 
1855 



245 1169 

246 551 



247 
246 
249 
250 
251 
252 
253 
254 



1346 
460 
1733 
1974 
806 
674 
753 
995 



255 1690 

256 994 



257 
256 



508 
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567 
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496 
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469 
1004 
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541 
720 
446 
569 
656 
1182 
621 
474 
459 
604 
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451 
788 
392 
553 
646 
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679 
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464 
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-6.7 
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-3.6 
•7.2 

-10.4 
-0.5 

-32.1 
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-0.6 

>0.0 
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-0.9 
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-8.4 
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-2.2 

-14.7 
c-35.0 

-18.0 

-16.7 

-6.5 
•30.9 

>0.0 

•0.3 
<-35.0 

-2.3 
-14.1 
-10.4 

-7.0 

-3.5 

•3.6 

-9.3 
-13.5 
•18.7 

^.9 
•12.8 
-13.5 

-1.0 
-15.8 

-3.6 
-10.8 

•3.7 

-5.2 

-5.5 

-2.4 
-22.0 
-13.7 
-13.1 

-3.2 
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-29.3 

-1.9 
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-44 



162,900 
69.300 
52.600 
43,000 
48.300 
51.600 
91.200 
42,000 
34,5a 
29.8a 
26.3a 
90.8a 
38.400 

44,9a 
44.2a 

52.400 

101.6a 
47,3a 

23.7a 
52.6a 

44.5a 
44.9a 

65.0a 
63,7a 

io7.ea 

37,400 

50.0a 
31.1a 
51.3a 
44.2a 
15.8a 
57.0a 

55,400 
44,400 

45.2a 
57,3a 
40.7a 

69.3a 

51.2a 

170.5a 
33.9a 

33.3a 
42,7a 

28.8a 
66.8a 

13.6a 
51.6a 

34,8a 

57.3a 

36.5a 

57.9a 
30.3a 

25,400 

30.2a 
53.5a 
42.5a 
62,1 a 

51,400 

45,8a 

23.8a 

46.oa 
59.3a 
6i. oa 
49.1a 
62,1 a 
61. ea 
39,2a 
69,5a 

52.500 
36.500 

61.9a 

44.6a 

30.2a 

60.400 
37.8a 



a) Master table of proteins in the rat liver database, showing spot master number, gel position U and,), isoelectric point relative to CPK standards and 
predicted molecular mass (from the standard curve of Fig. 8). »««ua/u»,«nu 
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Table 3. Computed pT$ of two sets of carbamylaied prouin standards: Rabbit muscle CPK and human 
hemoglobin (Hb) 
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FIR #ASP #GLU #HIS 
Name 3.9 4.1 6.0 



tlYS *ARG NH2- Cai c Real 
10.8 12.5 7.0 pi CPK 
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High Specific Activity Chemiluminescent and 
Fluorescent Markers: their Potential 
Application to High Sensitivity and 
'Multi-analyte' Immunoassays 



Roger Ekins*, Frederick Chu and Jacob Micallef 

Department of Molecular Endocrinology, University College and Middlesex School of Medicine 
University of London, Mortimer Street, London W1N 8AA, UK 



The sensitivrt.es of immunoassays relying on conventional radioisotopic labels (i e 
radio.mmunoassay (RIA) and immunoradiometric assay (IRMA)} permit the measurement of 
analyte concentrations above ca 10 7 molecules/ml. This limitation primarily derives in the 
case of 'competitive' or 'limited reagent' assays, from the 'manipulation errors arising in the 
system combined with the physicochemical characteristics of the particular antibody used- 
however, in the case of 'non-competitive' systems, the specific activity of the label may play a 
more important constraining role. It is theoretically demonstrable that the development of 
assay techniques yielding detection limits significantly lower than 10 7 molecules/ml depends 



on: 



(1) the adoption of 'non-compethive' assays designs; 

(2) the use of labels of higher specific activity than radioisotopes- 

(3) highly efficient discrimination between the products of the immunological reactions 
involved. 

Chemiluminescent and fluorescent substances are capable of yielding higher specific activities 
than commonly used radioisotopes when used as direct reagent labels in this context and both 
thus provide a basis for the development of 'ultra-sensitive*, non-competitive, immunoassay 
methodologies. Enzymes catalysing chemiluminescent reactions or yielding fluorescent 
reaction products can likewise be used as labels yielding high effective specific activities and 
hence enhanced assay sensitivities. 

A particular advantage of fluorescent labels (albeit one not necessarily confined to them) lies 
in the possibility they offer of revealing immunological reactions localized in microspots' 
distributed on an inert solid support. This opens the way to the development of an entirely new 
generation of 'ambient analyte' microspot immunoassays permitting the simultaneous 
measurement of tens or even hundreds of different analytes in the same small sample using 
(for example) laser scanning techniques. Early experience suggests that microspot assays with 
sensitivities surpassing that of isotopically based methodologies can readily be developed. 

Keywords: Ultrasensitive immunoassay; fluorescent microspot immunoassay; confocal 
microscopy ' 
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INTRODUCTION 

Immunoassay methods relying on radioisotopic 
labels have played a major role in medicine and 
other biologically related fields (agriculture 
veterinary science, the food and pharmaceutical 
industries, etc.) during the past two decades 
Their importance has derived from the exploita- 
tion both of the 'structural specificity' characteriz- 
ing antibody-antigen reactions and the 'detecta- 
bility of isotopically-labelled reagents, the latter 
permitting observation of the binding reactions 
between exceedingly small concentrations of the 
key reactants involved. The combination of these 

ZT* h i S end ° Wed rad ''oimmunoassaJ 
methods with unique specificity and sensitivity 
characteristics, and accounts for their ubiquitous 
use throughout modern medicine and biology 
However, in the past few years, interest has 
increasingly focused on so-called 'alternative' 
non-radioisotopic, immunoassay methods- <uch 
techniques are based on essentially identical 
analytical principles but differ in the markers used 
to iabe the part.cular immunoreactant (antibody 
or analyte) whose distribution between bound 
and free moiet.es (following the basic analytical 
reaction) constitutes the assay 'response' ' The 

S°headi^ iS imereSl b£ gr ° U P ed und - 
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fundamental reason for their replacement sterm 

d P e": d o°n iCa,,y ' ?*, --"^e q u~t m t S o 
develop microanalytica! techniques which are 

superior to them in this particular respect 

Radioisotopic methods are, in practice, limited to 

l^ e ^S em ? f anal - vte concentrations above 
about l0«- 0 9 molecules/ml (i.e. approx 0.15-1 5 
pmo / KDakubu „*/.. 1984). Howler, in certain 
fields (e.g. virology, tumour detection) there is a 
particular need to detect or measure molecular 
concentrates below this level. The factors which 
determine immunoassay sensitivity have been 

Ekin n s S, 197 y 8 f? 56 ' (Ek , inS £t a/ " ,968 ' 197 °a; 

1984; Ekins, 1985). Nevertheless, some of the 

VASTUS COnce P! s . a , re sli » frequently misunder- 
stood and merit brief discussion in the present 



The concept of sensitivity 



(1) Environmental; logistic; economic; practical- 

m Sr a d , C ° nVenien ^ - (i e - '"^-scientific). 
til -r? atta,nmenl of h'gher sensitivity 

(3) The development of 'immunosensors' and 
immunoprobes'. 

^ s^ems eVe,0Pment ° f ' multi " anaI y te ' assay 

Our own reasons for developing non-isotopic 
echmques fall principally under headings (2) and 

(4) , and this presentation will centre primari y on 
the concepts which underlie our immunoassay 
development strategy in these areas * 



THE ATTAINMENT OF 'ULTRA-HIGH' 
IMMUNOASSAY SENSITIVITY 

Though, as indicated above, the sensitivity of 
radioisotop.cally based immunoassay methods 
has constituted one of the principal foundations 
of their widespread use over the past 25 years a 



One major source of past confusion has been 
ft 5 fZT^ reg3rding ,hC COnc W of 'sensitiv- 
w th ,hi dn 8ny f a equatin S assa >' sensitivity 
with the slope of the dose-response curve (Yalow 
and Berson, ]970a, b ; Berson and Yalow 1973 
see a so Ekms et al., 1970b, Tait, 1970). It is now 
widely agreed that the notion that a steep™ 
dose-response curve implies greater sensitivity is 
erroneous. The invalidity of this belief fs deilv 
revealed by the fact that the relative magnitude 
of the responses yielded by two assay systems is 
dependent on the particular variable Ihich is 
5?° s f", , ° ^Present the response (see Fie 
Ka) (Ekms, 1976). For this and other reasons it 
has long been recognized that the 'sensitivity' of 
an assay can only be satisfactorily represented by 
its lower hm.t of detection (Fig. 1(b)), an d this 

ZTSIV™ emb ° died in 3,1 i"ternationX 
agreed definitions of the term. An essentially 

InH "J defin,tio " is a * 'he precision (i I 
standard dev.at.on) of measurement of zero dose 
since this quantity determines the least quantity 
distingu.shable from zero and hence the assay 
detection limit. The sensitivity of an assay is thus 
represented by the zero-dose intercept of the 
precision profile' (Fig. 2(a)) when the latter is 

Z^f ,CrmS f S,andard deviatio " rather 
than of coefficient of variation (Ekins, 1983a) In 
short, the more sensitive of two assays is the one 
yie ding greater precision of the zero dose 
estimate (Fig. 2(b)). 
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F/E plot 



B/F plot 



Any plot 




R 3 : 




(r*sponi«) 



a. 



b. 



Figure 1. (a) Diagrammatic representation of conventional RIA dose-response curves for systems using high (hi) and low (lo) 
antibody concentrations plotted in terms of free-bound (F/E) and bound/free (B/F) labelled antigen. Note that the use of a lower 
amount of antibody yields a dose-response curve of greater slope in the F/B plot, but of lower slope in the B/F plot. It is 
impossible to decide, on the basis of the date shown in this figure, which concentration of antibody yields the assay system of 
higher sensitivity, (b) The sensitivity of an assay is essentially represented by the minimum detectable dose. i.e. the SD of the 
dose measurement (SD {op5e) ) at zero dose. This is given by the SD of the response (SD^^e)) divided by the dose-response 
curve slope at zero dose (i.e. ((SD( response ,) x dD/dtf) 0 ). This quantity is unaffected by the choice of the coordinate frame used to 
plot the dose-response curve. (Note: it is common to multiply (SD (ao5eJ ) 0 by an arbitrary factor to increase the confidence level 
attaching to the minimum detectable dose estimate, though, since no agreement exists regarding the value of this factor, this 
unnecessary step merely adds to confusion when the relative sensitivities of two assay procedures are compared.) 



'Competitive' and 'non-competitive' (limited 
reagent' and 'excess reagent') assays 

A second important misconception in this area is 
the notion that immunoassays relying on the use 
of labelled antibodies (e.g. immunoradiometric 
assays, IRMA) are ipso facto more sensitive than 



those which rely on the use of labelled 'analyte' 
(e.g. radioimmunoassays, RIA); furthermore the 
grounds originally advanced for the claimed 
superiority of labelled antibody methods (Miles 
and Hales, 1968) were partially based on false 
concepts of sensitivity, and thus failed to identify 
the true reasons why certain assay designs are 




(a) (b) 



Figure 2. (a) The 'precision profile' of an assay portrays the error in the dose measurement as a function of dose. The error 
may be represented, inter alia, by the absolute error (AD; e.g. SD of D) or the relative error (A0/D; e.g. CV of D). (A0) o . the 
error in the measurement of zero dose, represents the sensitivity of the assay. The working range may be defined as the range 
of dose values within which AD/D is less than an 'acceptable' value set by the inv stigator. (b) The more sensitive of the two 
assays (assay I) intercepts the AD axis at a lower value. However, assay II is more precise at higher values of dose, and has a 
wider working range. 
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potentially capable of yielding far higher sensitiv- 
ity than others. This issue likewise merits 
clarification. 

The pureJy pragmatic sub-classification of 
immunoassays into labelled antibody and labelled 
analyte methods diverts attention from a more 
fundamental divide in immunoassay methodolo- 
gy, which relates to the optimal concentration of 
antibody required in an assay system to maximize 
its sensitivity. In certain assay designs (which may 
be termed 'limited reagent' or 'competitive') the 
optimal concentration tends to zero; conversely in 
others (which may be termed 'excess reagent' or 
'non-competitive') the concentration tends to 
infinity. Jt should be particularly emphasized that 
the optimal antibody concentration is essentiaUy 
governed, not only by the physicochemical charac- 
teristics of the antibody-analyte binding reaction, 
but also by the errors incurred in measurement of 
the assay response. Were an assay system to be 
totally error-free, no antibody concentration 
would be optimal, and the distinction between 
competitive and non-competitive methodologies 
would thus not arise. 

Though it is inappropriate in this presentation 
to discuss in detail the statistical and physico- 
chemical theory underlying this fundamental 
divergence in immunoassay design (see Ekins et 
al, 1968, 1970a; Jackson etal., 1983), the reason 
for it can perhaps be more readily understood if 
the basic principles of immunoassay are portrayed 
m a somewhat different way from that in which 
they are usually presented. All immunoassays 
essentially depend upon measurement of the 
'fractional occupancy' by analyte of antibody 
binding sites following reaction of analyte with 
antibody (see Fig. 3(a)). Those techniques which 
implicitly rely on measurement of residual 
unoccupied, binding sites optimally necessitate " 
the use of concentrations of antibody tending to 
zero, and may be termed 'competitive', converse- 
ly those in which occupied sites are directly 
measured necessitate use of high antibody con- 
centrations and are termed 'non-competitive' 
(Fig. 3(b)). This emphasizes that the differences 
in assay design characterizing so-called competi- 
tive and non-competitive methods are essentially 
unrelated to which component (if any) of the 
reaction system is labelled. Indeed immunoassays 
in which no label of any kind is involved can, on 
identical grounds, be subdivided into those of 
'limited reagent' (or 'competitive') and 'excess 
reagent' (or 'non-competititve') design. Thus the 



distinction between these two forms of im- 
munoassay simply reflects differences in the way 
that fractional antibody occupancy is determined 
and the fact that it is generally undesirable— for 
reasons of accuracy— to measure a small quantity 
by estimating the difference between two large 
quantities. When an immunoassay relies on the 
measurement of unoccupied antibody binding 
sites, the total amount of antibody used in the 
system must be small to minimize error in the 
resulting (indirect) estimate of occupied sites. 

Measurement of occupied sites 
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Figure 3. The dist.nct.on between non-competitive' (above) 
and competitive immunoassays (below) reflects how 
antibody b,nd,ng-s,te occupancy is measured Labelled 
antibody methods are non-competitive" if occupied Ss rt 
the (labe.led) antibody are measured, but are "comS ve ! 
(below nght) when unoccupied sites are measured iTbeMed 
antigen (below left) or labelled anti-idiotypfc antibod! 
^° dS h^ 0W Cemre ' rely 0n measurement Tt° l 
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"Competitive* immunoassay % Non-competitive" immunoassay 




Figure 4. Curves showing the theoretically predicted relationship between antibody affinity and the sensitivities achievable 
using 'competitive' and 'non-competitive' essay strategies. The 'potential' sensitivity curves assume the use of infinite specific 
activity labels; the sensitivities achievable using 12£ l-labelled antigen or antibody are also shown. Shaded areas indicate the 
sensitivity less due to errors in measurement of the label. Curves relating to 'competitive' assays assume a 1% error in 
measurement of the response variable arising from 'experimental' errors (i.e. errors other than those inherent in label 
measurement per se). Non-competitive curves assume 'non-specific binding* of labelled antibody of 0.01 % and 1 % (lower and 
upper curves) respectively. Arrows indicate sensitivities claimed for typical noncompetitive immunoassay methodologies. 



Conversely, when occupied sites are measured 
directly, this particular constraint does not arise; 
indeed, considerable advantage often derives 
from using relatively large amounts of antibody in 
the system. 



Sensitivity of 'competitive' and 
'non-competitive' immunoassays 

Competitive and non-competitive immunoassays 
differ significantly in many of their performance 
characteristics in consequence of the differences 
in optimal antibody concentration on which they 
rely. Most particularly they differ in their 
potential sensitivities. Figure 4. portrays the 
sensitivities predicted theoretically as a function 
of antibody binding affinity, making realistic 
assumptions regarding the experimental errors 
incurred in reagent manipulation, 'non-specific' 
binding of labelled antibody, etc., and assuming 
the use of optimal reagent concentrations (Ekins, 
1985). Amongst other concepts illustrated in the 
figure is the much greater assay sensitivity 
potentially attainable (using an antibody of given 
affinity) by adoption of a non-competitive 
approach. In short, whereas the maximal sensitiv- 



ity realistically achievable using a competitive 
design is in the order of 10 7 molecules/ml (using 
antibody of the highest affinity found in practice), 
a non-competitive method is capable of yielding 
sensitivities some orders of magnitude greater 
than this. However, Fig. 4 also demonstrates that, 
assuming the use of high affinity antibodies (i.e. 
-I0 n -I0 12 l/M), maximal sensitivities yielded by 
isotopically based techniques (whether relying on 
labelled antibody (IRMA) or labelled analyte 
(R1A), or whether of competitive or non- 
competitive design) are closely comparable, i.e. 
of the order of 10 7 -10 8 molecules/ml. 

This limitation is a manifestation of the fact 
that, in the case of the non-competitive methods, 
an important constraint on assay sensitivity is 
(under certain circumstances) the 'specific activ- 
ity' of the label used. On the other hand, 
limitation of assay sensitivity due to the low 
specific activity of radioisotopic labels does not 
often arise, in practice, in the case of competitive 
assays, whose sensitivity is generally restricted by 
other factors (Ekins, 1985). The fundamental 
significance of this conclusion is that, only by the 
use of labels possessing specific activities higher 
than those of the commonly used radioisotopes in 
assays of non-competitive design, can current 
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sensitivity limits be breached. Conversely use of 
a higher specific activity label in a competitive 
assay will usually have no significant effecf on its 
sensitivity (assuming experimental errors incur- 
red in reagent manipulation of the magnitude 
generally encountered in practice). 

High specific activity non-isotopic labels 

The term 'specific activity' i s conventionally 
applied, in the case of radioisotopic labels to 
denote the number of radioactive disintegrations 
per unit time per unit weight of the isotope or 
labelled compound. In the present context, use of 
the term is widened to signify 'detectable events' 
per unit time per unit weight of labelled material 
Thus it can be used to indicate the rate of photon 
emission by a chemiluminescent or fluorescent 
label, or the rate of conversion of substrate 
molecules-by an enzyme label-to molecules of 
a detectable product. The importance of the 
concept derives from the fact that 'sienal 
measurement error' (i.e. error in the measure- 
ment of the label per se) is a contributory factor in 
limiting assay sensitivity, and may— when other 
sensnivny-constraining factors are reduced- 
become dominant. Furthermore, when extending 
the sensmvit.es of immunoassay svstems bevond 
their present limits, the numbers of molecules 
involved are low, and statistical errors incurred in 
countmg indmdual 'detectable events', and the 
time require d to count them, may assume a 
particular importance. 

Table ] compares the specific activities of 
potentially useful labels with that of I25 l AH are 
of relevance in the context of this volume since 
chennluminescent and fluorescent labels can be 
used to label antibodies (or antigens) directly 
alternatively, enzyme labels catalysing reactions 
y.eld.ng chemiluminescent signals or fluorescent 
products can be utilized. 
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Teble 1. Relative specific sctivhi s f vari u* 
atopic and non-isctopic labels N t* thi *k I 
the specific activity f^^V^ d Ts 

JJIch ,e,y on the use 

Specific Activities 



125i 



Enzymes: 



Chemiluminescent 
labels 

Fluorescent labels: 



1 detectable event/sec/7.5 x to 6 
labelled molecules. 
1 detectable event/sec/5.6 x io 8 
labelled molecules. 
Determined by enzyme 'amplifica- 
tion factor' and detectability of 
reaction product. 

1 detectable event/labelled mole- 
cule. 

Many detectable events/labelled 
molecule. 



The importance of background in 
non-competitive immunoassays 

A second important factor governing the sensitiv- 
ity of non-competitive labelled-antibody im- 
munoassays is the 'background' or 'blank' signal 
emitted m the absence of analyte, since error in 
the measurement of this signal is clearly a major 
determinant of the error in measurement of zero 



• e , Am °ngft contributors to the background 

S3? Tmh he r SC ' °' lhe measuri "8 ''"^nnent 
itself, amb.ent signal generators (such as, in 

sandwich immunoassays, solid 'capture- 
amibodv supports or, in the case of radioisotopic 
methods, cosmic ray and other extraneous radia- 
>on sources) and 'non-specifically. bound' label- 
led antibody. Minimization of each of the* 
components is essential for maximal sensitivhy 
mere arithmetic subtraction of background is of 
absolutely no benefit in this context 
. Non-specific binding of antibody is of particular 
merest since the magnitude of this contribution 

ln?K P H en, -, imer on the amo «"t of labelled 
am.body used ,n the system, and the duration of 
•ts exposure to analyte. Thus increasing the 
amount of labelled antibody increases the amount 
of such antibody bound to analyte; however " 
may also .ncrease the non-specifically bound 

caus?a ,0 n P a I gre H ,er • Pr0pOr,iOna, exle nt/and thus 
cause a net reducnon m sensitivity. This effect 
undcrl.es the loss in sensitivity at higher antfbody 

tomZV™* d f PiC i Cd in Fi «- 5 (reproduced 
from Jackson et a!. , 1983). This phenomenon also 

underhes the relat.onship between sensitivity and 

the affmny constant of the labelled antibody 

deP H C, H d '! 4 - The P°«ession by labelled 
annbody of a high affinity constant implies that a 
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Antibody Affinity 




Labelled Antibody Concentration 

(moles/liter) 



Figure 5. Assay sensitivity (represented by the standard 
deviation of the zero dose measurement, o 0 ). plotted as a 
function of the concentration of labelled antibody (of affinity 
10 n L/M) used in the essay, assuming different levels of 
non-specific binding of labelled antibody "(Note: an irreducible 
instrument background has been assumed in the computa- 
tions represented; this limits the ultimate sensitivity attain- 
able, regardless of the concentration of antibody used.) 



lower concentration is required to yield the same 
level of analyte binding, albeit with reduced 
non-specific binding, thus increasing assay sensi- 
tivity 

In summary, the high sensitivity of non- 
competitive labelled antibody methods derives 
essentially from their permitted use of optimal 
concentrations of antibody which (provided non- 
specific binding of labelled antibody is low) 
are generally considerably greater than in com- 
petitive methods, not from the fact that the 
antibody is labelled. Labelled antibody methods 
generally fall in sensitivity as the concentration of 
antibody is reduced towards zero, ultimately 
yielding a sensitivity theoretically identical to that 
of competitive methods (Rodbard and Weiss, 
1973). (Paradoxically, early exponents of labelled 
antibody methods, whilst claiming them to be of 
higher sensitivity, also concluded that their 
sensitivity was increased by reduction in the 
amount of labelled antibody used (Woodhead et 
al., 1971). This incorrect conclusion — based on 
observation of effects on the slope of the 
dose-response curve — exemplifies the many falla- 
cies encountered in the immunoassay field stem- 
ming from confusion regarding the concept of 
sensitivity discussed above.) Finally it should be 



emphasized that maximization of the sensitivity of 
a non-competitive immunoassay generally implies 
the selection of reagent concentrations and other 
experimental conditions such that the [analvte 
signal/background] ratio (i.e. sib) is maximized. 
However, this simple relationship disregards 
statistical considerations which arise when the 
numbers of detectable events are very low, and a 
more appropriate objective may, under these 
circumstances, be maximization of the ratio s 2 Ib 
(Loevinger and Berman, 1951). 



Other performance characteristics of 
competitive and non-competitive 
immunoassays 

Non-competitive designs also display a number of 
other advantages deriving from the relatively high 
antibody concentrations on which they generally 
rely. These include increased reaction speeds 
(and hence shorter incubation times), decreased 
vulnerability to certain environmental effects 
(which cause variations in binding affinity be- 
tween antibody and analyte), reduced sensitivity- 
dependence on high antibody binding affinity, 
etc. 

Nevertheless a price has to be paid for these 
benefits; this includes the greater tendency of a 
large amount of antibody to bind molecules 
differing from, but with structural resemblance 
to, the analyte itself, implying a loss of assay 
specificity. This effect generally necessitates the 
use, whenever possible, of an 'immunoextraction' 
procedure using a second 'capture' antibody 
(usually directed against a different binding site, 
or 'epitope') as shown in Fig. 3(b). This 
technique — the 'sandwich' or 'two-site' im- 
munoassay (Wide, 1971)— thus potentially com- 
bines the twin virtues of ultra-high sensitivity and 
specificity (together with short reaction time), 
features of crucial importance in many diagnostic 
situations (for example, in the detection of AIDS 
viral antigens). (Note, however, that the loss of 
specificity inherent in non-competitive assay 
designs implies that they are less readily applic- 
able to the measurement of analytes of small 
molecular size, which cannot be simultaneously 
bound by two different antibodies directed 
against different antigenic sites on the molecule. 
Such analytes are generally more appropriately 
measured using 'competitive' assay methods.) 
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Development of ultra-sensitive 
immunoassay methodologies 

The perception that the development of 'ultra- 
sensitive' immunoassay systems (i.e. systems 
surpassing conventional RJA methods in sensitiv- 
ity) depends on (a) reliance on 'excess reagent' or 
*non-competitive' assay designs; (b) the use of 
non-isotopic labels displaying higher specific 
activities than commonly used radioisotopes: (c) 
the development of efficient separation svsYems 
(ensuring minimization of non-specific antibody 
binding, and hence of signal 'backgrounds'), and 
(d) dual or multi-antibody analyte-recognition 
systems (exemplified by 'sandwich' or two-site 
assays) to maintain/increase assay specificitv, has 
formed the basis of our own laboratory's im- 
munoassay development since the early to mid- 
1970s (Ekins, 1978). This led us, inter alia, to an 
immediate recognition (Ekins, 1979, 1980) of the 
importance of the in vitro techniques of mono- 
clonal antibody production pioneered bv Kohler 
and Milstein (1975), which are currently the 
subject of bitter patent disputes in the'USA 
(Ezzell, 1986, 1987a,b), and which may be 
expected in Europe. 

Meanwhile, of the candidate labels for use in 
this context, both chemiluminescent and fluores- 
cent labels offer many attractions. The develop- 
ment of stable, highly chemiluminescent, acridi- 
nium esters by McCapra and his colleagues 
(McCapra et al., 1977) has subsequently been 
exploited by Weeks et al (1983, 1984) and, more 
recently, by several commercial kit manufactur- 
ers; other workers have used more conventional 
chemiluminescent compounds to label immuno- 
assay reagents (see, for example, Kohen et al 
1984, 1985; Barnard etal., 1985). Yet others have 
relied on enzyme labels to catalvse chemilumi- 
nogenic (Whitehead etal., 1983) and fluorogenic 
(Shalev et al., 1980) reactions as indicated above. 
Detailed description of these various methodolo- 
gies is presented by others in this volume and 
need not be duplicated here. 

Common to all the 'ultra-sensitive' immuno- 
assay methodologies relying on such alternative 
labels is their dependence on a non-competitive, 
labelled antibody, assay strategy whenever 
appropriate; however, for the reasons indicated 
above, competitive methods continue to be 
generally employed for the measurement of 
analytes of small molecular size (e.g. therapeutic 
drugs, steroid and thyroid hormones, etc.). 



Nevertheless, the convenience (from a manufac- 
turing viewpoint, and for other technical reasons) 
of relying on standard labelling procedures has 
meant that, even in these cases, labelled antibody 
techniques are increasingly preferred/Though the 
commercial kits based on these various labels 
differ to a minor extent in sensitivity, specificity 
convenience, etc., such differences are at least 
partially attributable to differences in the physi- 
cochemical characteristics of the antibodies used 
m the kits, and to other 'immunological' factors 
unconnected with the particular nature of the 
label per se. 

Despite the obvious attractions of chemi- 
luminescent techniques in an immunoassay con- 
text the use of fluorescent labels combined with 
sophisticated time-resolution techniques for their 
detection (a concept arising from discussions with 

miH ,™ a ? " ] u 0) 3ppeared 10 us ( jn 

nnc k , } ? 0ff6r m ° re exci,in S 'ong-'erm 
possibilities for a number of reasons. These 
naturally included attainment of the enhanced 
specific activities and high signal to background 
ratios required for ultra-sensitive immunoassay as 
ind.cated above. However, more importantly 
fluorescence techniques also appeared to provide 
a simple route to the development of 'multi- 
analyte' assay systems of the kind described 
below. 

In pursuance of this strategy, we began 
collaboration with LKBAVallac, ca 1976-77 in 
the development of the instrumentation and 
technology required to develop such methods 
Fortunately a group of fluorescent substances 
generally known as the lanthanide chelates 
(including, m particular, the chelates of euro- 
pium, samarium and terbium facilitate such 
development, possessing prolonged fluorescence 

? CC S 'T S j~ ,0 - ,00 °ns), large Stokes shift 
(-300 nm) and other desirable physical character- 
istics which permit the construction of relatively 
cheap instrumentation for their measurement 
(Marshall etal., 1981 : Hemmila etal 1983) The 
fluorescent properties of the lanthanide chelates 
may be compared with those of a conventional 
tluorophor such as fluorescein which is characte- 
rized by a much smaller Stokes shift (-28 nm) 
and a fluorescent decay time and emission 
spectrum which imply that it is less readily 
distinguished from fluorescent substances present 

hnlH £ UCh 35 bilirubin) 0r in P lastic sample 
f k f" L he Un,que flu orescence characteristics 
of the lanthanide chelates thus permit them to be 
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measured in the presence of a fluorescence 
background (deriving from extraneous sources) 
which, in practice, approaches zero. Fig. 6 
illustrates the basic concepts involved in pulsed- 
light, time-resolved, fluorescence measurement, 
which form the basis of the DELF1A immunoas- 
say system currently marketed by LKEAVallac. 

Though it is inappropriate to pursue this 
subject in greater detail, attention should also be 
drawn to the possibilities offered by phase- 
resolved fluorimetry. This permits separate iden- 
tification of fluorophores differing in fluorescence 
lifetime by their exposure to a sinusoidally 
modulated exciting light source, and observation 
of their demodulated, phase-shifted, light emis- 
sion (McGown and Bright, 1984). This technique 
offers the possibility both of the development of 
homogeneous assays (relying on a difference in 
fluorescence decay time of bound and free forms 
of the fluorescent-labelled molecule), and of 
discriminating between two labelled antibodies in 
the context of multi-analyte 'ratiometric' im- 
munoassay as discussed below. 

'AMEIENT ANALYTE' IMMUNOASSAY 

Before proceeding to a discussion of the develop- 
ment of multi-analyte assays, another important 
concept, termed 'ambient analyte immunoassay' 
(Ekins, 1983b), must first be examined. This 
term is intended to describe a type of immuno- 
assay system which, unlike unconventional 



methods, measures the analyte concentration in 
the medium to which an antibody is exposed, 
being essentially independent both of sample 
volume, and of the amount of antibody present. 
This concept is illustrated in Fig. 7, and relies on 
the physicochemically-based proposition that, 
when a 'vanishingly small' amount of antibody 
(preferably, but not essentially, coupled to a solid 
support) is exposed to an analyte-containing 
medium, the resulting (fractional) occupancy of 
antibody binding sites solely reflects the ambient 
analyte concentration. Clearly the binding by 
antibody of analyte results in a depletion of the 
amount of analyte in the surrounding medium, 
but provided the proportion so bound is small 
(i.e. less than, for example, 1% of the total), such 
disturbance can be ignored. (This effect is closely 
analogous to that caused by the introduction of a 
thermometer into a medium possessing a much 
larger thermal capacity; the temperature disturb- 
ance caused by the thermometer itself is negligi- 
ble and can, in these circumstances, be disre- 
garded.) 

The principles of ambient analyte assay derive 
from the recognition that all immunoassays 
essentially depend upon measurement of the 
'fractional occupancy 1 by analyte of antibody 
binding sites following reaction of analyte with 
antibody as discussed above (Figs 3. (a) and (b)). 
The fractional occupancy of ('monospecific' or 
'monoclonal') antibody binding sites in the 
presence of varying analyte concentrations, plot- 
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Figure 6, Basic principles of pulse-light, time resolved 
fluorescence. Fluorescence emitted by the fluorophor (typi- 
cally a europium chelate) is distinguished from background 
fluorescence, which decays more rapidly. 
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Figure 7. Basic principle of 'ambient analyte' immunoassay 
(AAI). The fractional occupancy (F) of a vanishingly small 
amount of antibody {of affinity AO is determined by the analyte 
concentration in the medium ([An]). 
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ted against antibody concentration, is portrayed 
in Fig. 8. The fraction of analyte bound is also 
plotted in this figure. (Note: "for the sake of 
generality, all concentrations in this figure are 
expressed in terms of ]/A\ where K is the affinity 
constant of the antibody. For example, if K = 
10" L/M, a concentration of 0.1 x 1/K represents 
0.1 x 10" n M/L.or0.1 x J(T n x ](T 3 x 6 02 x 
10 23 = 6.02 x ]0 8 moJecuIes/ml.) 

It should be particularly noted that, at antibody 
concentrations of less than caO.Ol x 1/K antibody 
fractional occupancy is essentially dependent 
solely on the analyte concentration in the 
medium, and is independent of variations in 
antibody concentration. Thi: reflects the fact that 
this concentration of antibody binds less than 
approximately 1% of the analyte in the medium, 
irrespective of its concentration. This implies, for 
example, that the introduction of ]0, 300. or ] 000 
antibody molecules into a medium containing 
billions of analyte molecules will result, in each 
case, in virtually identical fractional antibody 
binding-site occupancy, the upper limit of anti- 
body concentration being determined by the 
antibody affinity constant. (An antibody concen- 
tration of 0.01 x 1/K is a hundred-fold* less than 



that (1 x 1/K) necessary to bind 50% of a 'trace' 
amount of analyte (see Fig. 8), claimed by Berson 
and ^ alow (1973) as maximizing assay 'sensitiv- 
ity (i.e. the slope of the dose-response curve 
when expressed in terms of bound/free labelled 
analyte). This false conclusion has subsequently 
become incorporated into the mythology of 
radioimmunoassay design which, regrettably a 
majority of kit manufacturers continue to accept ) 
The ambient analyte assay concept was origi- 
nally exploited in the original development of 
what has come to be known as 'two-step' free 
hormone immunoassay (Ekins ex aL y 1980) but it 
is clear that it is of far wider application, and can 
in particular, be utilized in the construction of 
immunosensors and immunoprobes. One such 
example is a probe for the measurement of 
salivary steroids that is currently being developed 
in our laboratory. Comprising a small antibody, 
coated plastic 'dipstick' comparable in size and 
shape to a clinical thermometer, this device is 
intended to permit the measurement of salivary 
steroid levels without requiring the collection off 
saliva. However, the concept also underlies our 
approach to multi-analyte immunoassay also 
under development in our laboratory. 
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Figur 8. Fractional antibody binding-site occunanrv in * 
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concentration extending over several orders of magnhu^ unaffected by variations in ant.body 

other competitive' immunoassays are commonly SesSLu^ 9 ^^ V V ' 1 N ° tG thal radi °'™noassays and 
{implying b 0 > 30%). in accordance with the precepts of ^ concentrates approximately 0.5/K-VK or above 

oerson and Yalow (e.g. Berson and Yalow. 1973). 
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MULTI-ANALYTE 'RATIOME7RIC 
IMMUNOASSAY SYSTEMS 

The concepts relating to ambient analvte im- 
munoassay and assay sensitivity outlined above 
are both exploited in our present development of 
a random access, multi-analyte, immunoassay 
technology capable of measuring, in the same 
small sample, virtually any number of individual 
analytes from selected analyte 'menus' (e.g. a 
hormone menu, viral antigen menu, an allergen 
menu, etc.). Many examples of a need to measure 
a multiplicity of different analvtes in the same 
sample exist in medical diagnosis, for example, in 
the routine diagnosis of thyroid disease, where it 
is frequently necessary to 'measure a number of 
different hormones and thyroid-related proteins 
At present, clinicians frequently experience diffi- 
culty in deciding on the best sequence of tests to 
arrive at a correct diagnosis. Such problems 
would be overcome were all relevant analvtes 
measurable at a cost comparable to the cost of 
measurement of a single substance. Our own 
immediate objective is the development of a 
technology permitting the measurement of com- 
plete 'hormone profiles' using a single small blood 
sample. However, the need for 'multi-analvte', or 
'random access' measurement is not confined to 
medical diagnosis: it also arises, for example, in 
the pharmaceutical industry (where there exists a 
requirement to ensure the purity of protein drugs 
synthesized by recombinant DNA techniques), in 
the food industry and elsewhere. Though still' at 
an early stage, our approach to the achievement 
of this objective can be briefly indicated. 

Multi-analyte assay: general principles 

As discussed above, the notion of ambient 
analyte assay simultaneously introduces two 
extremely important and novel concepts: (a) that 
an estimate of analyte concentration can be based 
upon the use of an infinitesimal amount of 
'sampling' antibody, and (b) that such an estimate 
derives from a direct measurement of fractional 
antibody occupancy by analyte, irrespective of 
the exact amount of antibody used. It should be 
emphasized that the latter proposition is valid 
only in the context of ambient analyte assay, and 
is not true in current conventional immunoassay 
systems (in which fractional antibody occupancy 
depends both upon the amount of antibody in the 



system, and sample volume— see Fig. 8). In short 
exposure of a small number of antibody mole- 
cules (in the form, for example, of a 'microspot' 
located on a solid support) to an analyte- 
containing fluid results in occupancy of antibody 
binding sites in the microspot reflecting the 
analyte concentration in the medium. Following 
such exposure, the antibody-bearing probe may 
be removed and exposed to a 'developing' 
solution containing a high concentration of an 
appropriate second antibody directed against 
either a second epitope on the analyte molecule if 
this is large (i.e. the occupied site), or against 
unoccupied antibody binding sites in the case of 
small analyte molecules (see Fig. 3(b)). (Note: an 
antibody simulating antigen, and reacting with 
unoccupied binding sites, is described as a 
'mirror-image anti-idiotypic antibody'; the use of 
such an antibody instead of labelled antigen is 
convenient but not essential, and is suggested 
here merely to simplify illustration of the basic 
concepts involved.) 

Subsequently, an estimate of binding-site occu- 
pancy of the 'sampling* (solid phase) antibody 
located in the microspot may be derived by 
measurement of the ratio of signals emitted by the 
rwo^ antibodies forming the dual-antibody 'coup- 
lets'. This can be conveniently achieved by- 
labelling the 'sampling' and 'developing' anti- 
bodies with different labels, for example, a pair of 
radioactive, enzyme or chemiluminescent mar- 
kers. Fluorescent labels are nevertheless particu- 
larly useful in this context because, by the use of 
optical scanning techniques, they permit arrays of 
different antibody 'microspots' distributed over a 
surface, each directed against a different analyte, 
to be individually examined, thus enabling 
multiple assays to be simultaneously carried out 
on the same small sample. Fig. 9 illustrates these 
basic ideas, and Fig. 10 such an array. 

Microspot immunoassay sensitivity: 
theoretical considerations 



The notion that it is, in principle, possible to 
measure an analyte concentration using a micros- 
pot of antibody comprising a number of antibody 
molecules in the range ca lO'-lO 6 is likely, at first 
sight, to appear surprising, and may, indeed 
provoke scepticism regarding the assay sensitivi- 
ties potentially attainable using this approach 
Clearly a number of factors, such as the sensitivity 
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of the signal measuring equipment, the density of 
antibody molecules on the surface of the solid 
support, etc., are likely to play a part in 
determining final assay sensitivity. Such factors 
are in turn, dependent on the efficiency with 
which the particular labels used can be detected 
the adsorption properties of antibody supports' 
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Figure 10. 'Multi-analyte' antibody array. Each antibody 
microspot' represents a Vanishingly small' amount of 
antibody directed against an individual analyte. 



etc. Though these are obviously variable, reason- 
able estimates can be made of the order of 
sensitivities likely to be achieved on the basis of 
some simple theoretical calculations. To clarify 
the following discussion, it is assumed that 
sensing' antibody can be uniformly and consis- 
tently coated on a solid matrix at a standard 
density implying that only the 'developing' 
antibody need be labelled and measured in order 
to ascertain fractional occupancy of sensing 
antibody binding sites. 

Fig. Ji illustrates the surface of an antibody 
microspot, of surface area A^m 2 ), and (uniform- 
ly) coated with antibody of affinity K(UM) in a 
monomolecular layer of density D(molecules/ 
um ). Let us assume that the spot is exposed to an 
analyte-containing medium of volume v(ml), and 
containmg an analyte concentration C molecules/ 
ml. I he molecular concentration of antibody in 
the system is thus given by AD/v. (Note: the fact 
that antibody is situated on the surface of a solid 
support, and not evenly distributed throughout 
the medium, does not affect the extent of analyte 
binding at thermodynamic equilibrium, assuming 
that antibody binding sites are not impeded in 
their reactions and have not been damaged durine 
the coating process.) 

Meanwhile, fractional occupancy (F) of anti- 
body binding sites by analyte (at equilibrium) is 
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given by the equation: 

F 2 - F{\lq + + 1) = o (1) 

where p ~ analyte concentration, q = antibody 
concentration (both expressed in units of l/K). 

Thus, for antibody binding site concentrations 
-» 0 (i.e. ? < 0.01), F - /?/(] + p) ; (see Fig. 8). 

Likewise, the fraction of analyte bound by 
antibody (/) at equilibrium is given by the equation: 

f 2 ' /(1/p + qlp + 1) + qlp = 0 (2) 

Thus, for analyte concentration -> 0 (i.e. p < 
0.01),/* ql{\ + q)\ (see Fig. 8). Furthermore, 
when q < 0.01, and when p ^ 0, / < 0.01. 

Expressed in units of l/K; the concentration (<?) 
in the assay of Sensing' antibody situated on the 
microspot is given by DAK/(v x 6 x 10 20 ), (since 
Avogadro's constant, expressed as the number of 
molecules/mmol, is 6 x 10 20 (approximately)). 
The fraction of an analyte concentration — ► 0 
which will be bound to the spot is therefore 
DAK/(v x 6 x 10 20 + DAK), implying that the 
number of analyte molecules bound to the spot is 
given by vCDAK/(v x 6 x 10 2 ° + DAK). 

Case 1: sandwich (two-site) assay. Following 
incubation of sample with antibody, we assume 
the sample is removed, and the microspot then 
exposed to a volume V{m\) of a solution of a 
second, labelled, 'developing' antibody of affinity 
K* (L/M) at a concentration given by Q 
(expressed in units of l/K*). 



The fraction of analyte bound by labelled 
antibody (P) at equilibrium is given by the 
equation: 

r 2 - r(\/p + qip + i) + qip = o (3) 

where P represents the analyte concentration in 
the developing-antibody solution, expressed in 
units of l/K\ i.e. vCDAKKVUv x 6 x 10 20 + 
DAK)V x 6 x lO 20 ]. 

Assuming P < 0.01, F* = Q/(l + Q). (For 
example, if Q = 1, the fraction of analyte 
molecules bound by labelled antibody = 0.5 
approximately). Thus, since the number of 
analyte molecules bound to the spot is given by 
vCDAKI{v x 6 x 10 2() + DAK), the number of 
analyte molecules labelled by the second, de- 
veloping, antibody is given by vCDAKQ/[(v X 6 
x I0 2n + DAK){\ + Q)] n and the surface density 
of such molecules is given by vCD KQI[(v x 6 x 
10 20 + DAK) (1 + 2)1. Moreover, assuming that 
DAK < v x 6 x 10*° (i.e. that the amount of 
antibody in the system is such that 'ambient assay' 
conditions prevail, then the surface density (D*) 
of developing-antibody molecules = CDKQ/[(6 
x 10 20 )(1 + Q)] approximately. It should be 
noted that D* is independent of both v and V, 
also that the ratio DVD = C x KQ/[(6 x 10 20 )(1 
+ Q )] = C x constant. 

If the minimum detectable surface density of 
developing-antibody molecules (i.e. OpH, the 
standard deviation of the measurement of D* 
when C = 0) is given by D^ in (molecules/jxm 2 ) 
an d C min represents the minimum detectable 
analyte concentration in the test sample, then, 
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disregarding non-specific binding of developing 
antibody within the microspot area, 

C min = D* min x [(6 x lO 20 )^ + Q))/DKQ(4) 

For example, if Q = 1 , D = 10 5 molecules/^m 2 , K 
= 10 11 L/M and D^ in = 20 molecules/fim 2 , then 
C min = 2.4 x 10 6 molecules/ml = 10* 15 M/L. It 
should be noted, in this example, the fractional 
occupancy of the sensing antibody binding sites 
by the minimum detectable analyte concentration 
is 0.04%. 

Case 2: anti-idiot} pic antibody ('competitive') 
assay. In this case, we assume that, following 
removal of the sample, the microspot is exposed 
to a volume V(ml) of a solution of (for example) a 
second, labelled, anti-idiotypic antibody reacting 
with unoccupied sites on the sensing antibody. 
Using similar reasoning as above, we may 
likewise assume that the fraction of such sites 
which become occupied by the anti-idiotypic 
'developing' antibody is given by Ql(\ + Q)^ 
where Q is the developing-antibody concentra- 
tion. However, the minimum detectable surface 
density of anti-idiotypic antibody is not, in a 
competitive design, the critical determinant of 
assay sensitivity; this parameter is essentially 
governed by the precision of the density measure- 
ment. 

From Eq. (1), the fraction of sites unoccupied 
jy analyte = 1/(1 + /?), and the fraction occupied 
yy anti-idiotypic antibody = QI(\ + p)(] + Q) m 
Fhus, if the CV in the measurement of anti- 
diotypic antibody is e, the standard deviation is 
0/0 + p)0 + Q)- This term also represents the 
JD in the estimate of the fraction of sites occupied 
>y analyte. Since the total number of antibody 
>inding sites in the spot is DA, the SD in the 
stimate of occupied sites as p -> 0 (i.e. oD<*) 
pproximates WAQI{\ + 0; the SD in the 
»ccupied site surface-density estimate is thus 
DQI{\ + Q). But the SD in the measurement of 
ractional binding-site occupancy when p — ► 0 
efines D min , and hence the minimum detectable 
nalyte concentration in the test sample as 
idicated in Eq (4). 
Thus 

: min = D min x [(6 x 10 20 )(1 + Q))IDKQ (5) 

= zDQIQ + Q) ± [(6 x 10 20 )(1 + Q)) 

DKQ (6) 

= zIK x (6 x 10 20 ) (7) 



For example, if values of Q = \ y £) = \tf 
molecules/^m 2 , and K = 10 M L/M are assumed as 
in the non-competitive example considered 
above, and the CV in the measurement of 
anti-idiotypic antibody density in the microspot is 
1% (i.e. e = 0.01), then D m ] = 500 molecules/ 
\im , and C min = 6 x 10 ) molecules/ml = 
10" 1 M/L. Fractional occupancy of the sensing 
antibody binding sites by the minimum detectable 
analyte concentration is, in this example, 1%. It 
should be noted that the sensitivity limit of dK 
(expressed in molar terms) is identical to that 
previously established for conventional 'competi- 
tive' assays (Ekins and Newman, 1970), and 
which underlies the predictions represented in 
Fig. 4. 

Such considerations appear to suggest (a) that 
microspot assay sensitivities superior to those 
obtainable by conventional radioisotopically 
based immunoassays are achievable, and (b) that 
sensitivities yielded by non-competitive microspot 
assays are likely to be considerably greater than 
those of corresponding compevitive microspot 
assays. It must be emphasized, however, that, 
though such predictions are likely to prove 
correct, assumptions regarding the performance 
of the labels and signal-measuring instrument 
used are incorporated in the simple theoretical 
analysis discussed above. Such factors are clearly 
of importance in determining overall microspot 
immunoassay performance. 



Practical implementation 

The concepts discussed above are clearly exploit- 
able using a variety of antibody labels, including 
chemiluminescent labels; however, our prelimin- 
ary studies have been based on the use of 
conventional fluorophores, since the technology 
of simultaneous measurement of dual fluoresc- 
ence from small areas is already well established. 
Because this volume centres on chemiluminesc- 
ence, we shall provide only a brief indication of 
our initial experimental work in this area, which is 
currently based on the use of commercially 
available confocal microscopes. 

Instrumentation: the laser scanning confocal 
microscope. In laser scanning confocal fluoresc- 
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ence microscopy, a small area of the specimen is 
illuminated by a focused laser beam; the fluoresc- 
ence photons emanating solely from this area are, 
in turn, focused onto a photon detector. Both the 
intensity of illumination and the efficiency of light 
collection diminish rapidly with distance from the 
focal plane (Fig. 12). At the 'confocal' point, the 
projection of the illumination pinhole and the 
back-projection of the detector pinhole coincide. 
Such systems contrast with conventional epi- 
fluorescence methods, where the specimen is 
exposed to an essentially uniform flux of illumina- 
tion (White etal., 1987). 

Sensitivity of current instruments. Typically, 
fluorescence photons emanating from the laser- 




Figure 12. Principle of the confocal microscope. Illuminating 
light is focused at a point in the focal plane. Reflected light 
from this point is focused onto a detector. A complete 
two-dimensional image of structures within the focal plane is 
obtained by scanning the selected area of interest, and may 
be stored in a microcomputer for video display 



illuminated area are detected by a low dark- 
current photomultiplier. Electrons spontaneously 
emitted by the photomultiplier photocathode 
contribute to the background signal of the 
instrument, and must, for highest sensitivity be 
minimized. Fortunately the overall design of such 
instruments permits the photomultiplier photo- 
cathode to be of very small area, so that this 
particular source of background noise is not only 
small, but can be expected to reduce in relative 
importance with future improvement in photo- 
multiplier design. Meanwhile current instruments 
already display very high sensitivity of detection 
of fluorescent signals. For example, the confocal 
microscope manufactured by Zeiss is claimed to 
display a lower detection limit for fluorescein of 
about ten molecules/um 2 (Ploem, 1986). Most 
commercially available FITC-iabelled IgG attains 
a fluorophore/protein molar ratio of -4- thus the 
detection limit (D* in ) of the Zeiss microscope is 
-2-3 FITC-labelled IgG molecules/u.m 2 . This 
implies an analyte-concentration detection limit 
of -2.4 x ]0 molecules/ml for a two-site assay, 
assuming the same parameter values as used in 
the examples discussed above, or 2.4 x 10* 

1C? 2 l!m S/m ' US ' ng 3 Sensing ' antibod y of affinity 
Another comparable instrument is the Bio- 
Rad/Lasersharp laser scanning confocal micro- 
scope, which we are currently using in the 
development of 'ratiometric' multi-analyte assay 
methodology in accordance with the principles 
outlined above (see Fig. 13). The argon laser in 
this system possesses two excitation lines at 488 
and 514 nm. It is thus particularly efficient for the 
excitation of blue/green emitting fluorophores 
such as FITC (which displays an excitation 
maximum at 492 nm). However, it is considerably 
less efficient in the excitation of red-emitting 
fluorophores such as Texas red (excitation max- 
imum 596 nm). However, the ratiometric im- 
munoassay principle permits considerable varia- 
tion in detection efficiencies of the two labels 
relied on since, inter alia, the specific activities of 
the two labelled antibody species forming the 
antibody couplets can be chosen to yield optimal 
signal ratios in the region of unity. Thus 
inefficiency of the argon laser in exciting red 
emitting fluorophores is not necessarily a major 
handicap in the present context. 

Though the current Lasersharp instrument 
relies on a conventional microscope rather than a 
purpose-designed optical system (and appears to 
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?hZT ny 'rJ S a k nt,c, P ated to become available 
shortly Th,s photometer is claimed to be highly 

■ " T 66 USCd in l ^"smission and 
reflection modes, and as a highly sensitive 
fluonmeter. The measuring field can be varied 
snape and sjze for optimum adjustment to the 
specimen structure. More generally, the technol- 
ogy of sensmve light measurement is improving 
rapidly ,„ response to needs in astronomy £f 
space program etc., such technology clearly be n£ 
readily exploitable in a multi-analvte immuno 

SdS e with u t 8 h Iight ; genera ^ ~* 

accordance with the broad principles presented 



Beam 
Splitters 



Objective 



Antibody array 



Antibody microspot 



np 9 rm£-™ * " Chamel Confocal f| u°'«eence microscope 
permitting simultaneous measurement of the fluorescence 
signals from two fluorophors situated at the focal poi£ t Bv 
scanning the antibody array, the ratio of signals rSm each 
antibody m.crospot may be determined 

be less sensitive), it permits quantification of 
fluorescence signals generated from microspots of 
selected area. Initial studies have revealed that 
under conditions that are not necessarily optimal' 
the instrument is capable of detecting approx-' 
imately twenty-five FJTC-labelled JgG molecules/ 
M-m , scanning an area of ~50iim f (Fie 14) if 
must be stressed that neither of these confoca 
microscopes are designed specifically for routine 
ratiometric multi-analyte immunoassay use and 
«t can be anticipated that future instruments 
constructed specifically for this purpose are likelv 
to prove both cheaper and more sensitive. 

Other instruments. The MPM 200 Microscope 
Photometer manufactured by Zeiss of West 



Sol,d antibody supports. On the basis of the 
theoret.cal considerations discussed above it is 
evident that solid antibody supports for milt* 

; te rK U TJ USC shou 'd display a capac- 
ity to adsorb a high surface density of antibody 
combined w.th low intrinsic signal-gene bating 

eXample ' ' OW imrinsic fluor «" 
encej, thus minimizing background. We have 

examined a number of materials, including 

lose P Sh ne ' Tefl °!l' Ce,,U ' OSe and nitroc eTu 8 
Jose membranes and microtitre plates (clear 
polystyrene plates from Nunc; black, wnite and 
clear polystyrene plates from Dynatech with- 
these cntena ,n mind. White Dynatech Micro- 
fluor microtitre plates, formulated specially for 
the detection of low fluorescence signals ykld 
high signa -to-no.se ratios and have therefore 
studies Pr0V1S,0na,,y USCd " ° Ur d -elo P menta1 

IZeZ?Z SUy ° f a Kl ib ° dy COa,in 8' Preliminary 
experiments using M.crofluor plates have re- 
vealed that ,t is possible to coat them wth 
antibody at a surface density of a, least 5 x $ 

an^hoJ CU 'f S/ T (Fig - 15) - Moreover ne "ly all 
antibody molecules so deposited appear to retain 

immunological activity (Fig. 16). 

Verification of the 'ratiometric' imunoassay con- 
cept. Our primary intention, in initial studies has 
been establishment of the basic conditions which 

on rhL Pa 7 ICU , lar ,nstrument < "n be anticipated 
on theorefcal grounds to yield high assay 
sens.t,v,ty Though the setting up of individual 
microspot immunoassays has thus appeared \o u 

Les nf Se n° ?'r P ° rtanCe duri "g the initial 
stages of our studies, we have nevertheless 
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Figure 16. Surface density of immunoreactive IgG molecules (number of molecules/jim 2 ) plotted as a function of the total 
surface density of IgG (number of molecules/ujn 2 ) on Dynatech Microfluor white microtitre plates 



thought it useful to confirm the validity of our 
general concepts by comparing the performance 
of certain assays when constructed in microspot 
format and when conventionally designed. For 
example, we have compared a dual-labelled 
tumour necrosis factor (TNF) ratiometric assay 
system using Texas red and FITC-labelled anti- 
bodies with an optimized 1RMA system using 
identical antibodies but with the second antibody 
I25 I-labeIled. Although unoptimized, the 
ratiometric microspot assay yielded formal sensi- 
tivity values closely approaching that of the 
conventional, optimized, IRMA. Although 
verifying the general concepts underlying 
ratiometric microspot immunoassay methodolo- 
gy, further work is required to achieve the 
considerably greater sensitivity that theory pre- 
dicts as achievable using optimized reagent 
concentrations and improved instrumentation. 



CONCLUSION 

As indicated above, differentiation of the fluores- 
cent signals yielded by two fluorophores can be 
readily achieved solely on the basis of wavelength 
differences, and this approach has been relied on 
entirely in our preliminary studies. However, 



other physical techniques exploiting differences in 
decay time of two or more fluorescence emissions 
(using, for example, a pulsed or sinusoidally 
modulated laser source, and time- or phase- 
resolving detectors) are available, and can be 
expected both to further reduce background and 
to improve signal resolution, thus increasing assay 
sensitivity and precision. These considerations 
aside, the basic technology involved closely 
resembles that employed in domestic compact 
disk recorders and other similar data-storage 
devices, the obvious difference being that light 
emitted from each of the discrete zones forming 
the antibody-array is fluorescent rather than 
reflected, and yields chemical rather than physical 
information. Indeed, our preliminary studies 
suggest that highly sensitive immunoassays using 
antibody microspots of surface area approximat- 
ing 50 nm 2 are achievable, implying that some 
2,000.000 different immunoassays could, in prin- 
ciple, be accommodated on a surface area of 
1 cm 2 . Though non-specific binding of a multiplic- 
ity of developing antibodies would probably 
prohibit the use of antibody arrays of this order, it 
is evident that the technology is capable of 
encompassing analyte numbers of the kind likely 
to be useful in practice. 

The development of multi-analyte assay sys- 
tems of this kind can be anticipated to bring about 
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fundamental changes in medical diagnosis and 
many other biologically related areas. Systems 
capable of measuring every hormone and other 
endocrinological^ related substance within a 
single small sample of blood are within technolo- 
gical reach, providing data which, when analysed 
with the aid of computer-based 'expert' pattern- 
recognition systems, are likely to reveal endoc- 
rine deficiences only dimly perceived using 
current 'single-analyte' diagnostic procedures. 
Such systems also provide a means to the 
development of a 'random access' immunoassay 
methodology, permitting the selection of any 
desired test or combination of tests from an 
extensive analyte menu. Clearly the accommoda- 
tion of a wide range of individual immunoassays 
on a small immunoprobe (comparable in its 
overall physical dimensions with a few drops of 
blood) is likely to totally transform the logistics of 
immunodiagnostic testing, and genuinely repre- 
sents, in our view, 'next generation' immunoassay 
methodology. 
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Multianalyte Microspot Immunoassa^-Microanalytical -'Compact Disk" of the Future 

R. P. Ekins and F. W. Cbo 



Throughout the 1970s, controversy centered both on im- 
munoassay "sensitivity" per se and on the relative sensi- 
tivities of labeled antibody (Ab) and labeled analyte meth- 
ods. Our theoretical studies revealed that RIA sensitivities 
could be surpassed only by the use of very high-specific- 
activity nonisotopic labels in "noncompetitive" designs, 
pr feraWy with monoclonal antibodies. The time-resolved 
fluorescence methodology known as DELFiA-<jeveloped in 
collaboration with LKB/Waliac-— represented the first com- 
mercial "ultrasensitive" nonisotopte technique based on 
these theoretical insights, the same concepts being sub- 
sequently adopted in comparable methodologies relying 
on the use of chemiluminescent and enzyme labels. How- 
ever, high-specific-activity labels also permit the develop- 
ment of "multianalyte" immunoassay systems combining 
ultrasensitive with the simultaneous measurement of tens, 
hundreds, or thousands of analytes in a small biological 
sample. This possibility relies on simple, albeit hitherto- 
unexploited, physicochemical concepts. The first is that all 
immunoassays rely on the measurement of Ab occupancy 
by analyte. The second is that, provided the Ab concentra- 
tion used is 'Vanishingty small," fractional Ab occupancy is 
independent of both Ab concentration and sample volume. 
This leads to the notion of "ratiometric" immunoassay! 
involving measurement of the ratio of signals (e.g., fluores- 
cent signals) emitted by two labeled Abs, the first (a 
"sensor" Ab) deposited as a microspot on a solid support, 
the second (a "developing- Ab) directed against either 
occupied or unoccupied binding sites of the sensor Ab. Our 
preliminary studies of this approach have relied on a 
dual-channel scanning-laser confocal microscope, permit- 
ting microspots of area 100 yjtf or less to be analyzed, 
and implying that an array of 1 0 6 Ab-containing microspots, 
each directed against a different analyte, could, in princi- 
ple, be accommodated on an area of 1 cm 2 . Although 
measurement of such analyte numbers is unlikely ever to 
be required, the ability to analyze biological fluids for a wide 
spectrum of analytes is likely to transform immunodiagnbs- 
tics in the next decade. 

Additional Keyphrues: ratiometric immunoassays • scanning- 
laser confocal microscope • fluoroimmunoassay 

Immunoassay and other protein-binding assay meth- 
ods based on the use of radioisotopic labels have played 
a mq'or role in medicine during the past three decades. 
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Their utility and importance have derived primarily 
from the structural specificity of many reactions be- 
tween binding proteins and analytes and the detectabil- 
ity of isotopically labeled reagents, the latter endowing 
such techniques with "exquisite sensitivity." Recently 
however, interest has increasingly focused on noniso-' 
topic techniques based on identical analytical princi- 
ples, differing only in the nature of the marker used to 
label the reactant (e.g., antibody or antigen), whose 
distribution between reacted ("bound") and unreacted 
Cfree") fractions constitutes the assay ^response." 

The basic aims underlying this interest can be 
broadly classed under four main headings: 

• avoidance of the environmental, legal, economic, and 
practical disadvantages of isotopic techniques (e.g., lim- 
ited shelf life of isotopically labeled reagents, problems 
of radioactive waste disposal, cost and complexity of 
radioisotope counting equipment), particularly those 
impeding the development of, for example, simple diag- 
nostic kits^or home or doctor's office use; 

• achievement of greater assay sensitivity; 

• "direct" measurement of analyte concentrations by 
use of transducer-based n immunosensors"; 

•simultaneous measurement of multiple analytes 
Cmultianalyte assay"). 

In this presentation I will focus primarily on the last 
of these objectives, using thiB to set out the principles 
underlying our present attempts to develop a new ft min- 
laturized" technology that will permit the simultaneous 
measurement of an unlimited number of analytes in a 
small biological sample such as a single drop of blood. 
However, retention (and, if possible, improvement) of 
the high sensitivities of conventional isotopic tech- 
niques is a basic aim not only of our own studies in this 
area but also of most other endeavors falling under the 
above headings. It is therefore appropriate to preface 
this paper with a discussion of the general principles 
underlying the attainment of high binding-assay sensi- 
tivity. 

Immunoassay Sensitivity: Some Basic Concepts 

Definition of Assay Sensitivity 

The need to establish assay conditions yielding max- 
imal sensitivity underlay the independent construction 
of mathematical theories of immunoassay design by 
both Yalow and Berson (1) and Ekins et al. (2) in the 
course of the original development of these methods in 
the early 1960s. Regrettably, these theoretical studies 

led to a prolonged controversy, arising largely from the 
conflicting concepts of "sensitivity" adopted by the two 
groups (see Figure 1). Briefly, Berson and Yalow, in 
their many publications relating to immunoassay de- 
sign (e.g., I, 3), defined sensitivity as the 6lope of the 
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Fig. 1. The differing concepts of sensitivity and precision underlying 
radioimmunoassay design theories developed by [left) YaJow and 
Berson (e.g., 1, 3) and (right) Bans et el. (2, 4) 
Yetow end Benon define assay A as more sensitive because it yields a 
raaponaa curve o! grMte^ slope. Ekins at al. Mine assay B as more sensitive 
because the Impfecwon of measurement of 2ero ttse (<r 0 ) is less Yalowand 
Berson likewise define an assay system as more precise fl it yields a steeoer 
response curve when data are plotted on a log dose scale 

response curve relating the fraction or percentage of 
labeled antigen bound (b) to analyte concentration ([H]). 
In contrast, Ekins et al. (e.g., 2, 4) defined sensitivity as 
the (imprecision of measurement of zero dose, this 
quantity being indicative of, and essentially equivalent 
to, the lower limit of detection. 

The key difference between these two definitions 
clearly lies in the dependence of the assay detection 
limit on the error (imprecision) in the measurement of 
the response variable. By neglecting this crucial factor, 
the "response curve slope" definition leads to many 
obvious absurdities. For example, plotting conventional 
RIA data in terms of the response metameter B/F (i.e., 
the bound to free ratio) suggests that assay "sensitivity*' 
is increased by increasing the antibody concentration in 
the system; however, the converse conclusion is reached 
if identical data are plotted in terms of F/B' (see Figure 
2). Observation of the shape and slopes of response 
curves without detailed error analysis thus constitutes a 
totally misleading guide to optimal immunoassay de- 
sign. This approach has, however, characterized many 
of the studies conducted in the immunoassay field dur- 
ing the past 30 years, and has been the source of much 




Any plot 




Response curve slope Detection limit 



Fig. 2. Schematic representation of RIA dose-response curves 
observed for high and low antibody concentrations plotted in terms ot 
{left) the free/bound fraction (F/B); (csnter) the bound/free fraction 

(B/F) 

Note that the low antibody concentration yields a response curve of greater 
slope when the a*wy response is plotted fn terms of F/B. but of lower sJope 
when plotted In terms of B/F. The precision of measurement of 2wo dose 
(ADo) is Independent of the coordinate frame used to plot assay data (see 

right) 
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mythology. For example, consideration of the Law of 
Mass Action reveals that, when response curves corre- 
sponding to different antibody concentrations are plot- 
ted in terms of b vs [H], the maximal slope at xero doee 
is obtained for a concentration of QAfK (where IT is the 
affinity constant), in which circumstance the zero dose 
response (bo) is 33%. This conclusion led to Berson and 
Yalow's enunciation of the well-known dictum (which, 
albeit erroneous, is broadly adhered to by many immu- 
noassay practitioners and kit manufacturers) that, to 
TnaTimi7i» RIA sensitivity, the amount of antibody to use 
in the system is that which binds 33% of labeled antigen 
in the absence of unlabeled antigen (I, 3). 

Disagreement regarding the concept of sensitivity 
inevitably led to prolonged dispute regarding immu- 
noassay design (5). However, although it is still common 
to encounter publications in the field that rely solely on 
the response curve slope as a measure of sensitivity, the 
assay detection limit is now widely accepted as th nly 
valid indicator of this parameter, and we do not there- 
fore intend to dwell further on this issue here. It is 
nevertheless relevant to an understanding of the Vin . 
iaturized" assay methodology described below to empha- 
size that untenable concepts of both sensitivity and 
precision underlie many of the commonly accepted rules 
governing current immunoassay-design practice, some 
of which are contravened in our own approach. 

Basic Immunoassay Designs 

It is likewise important in the present context to 
comprehend the basis of the various types of immunoas- 
says currently in use, and the constraints on the sensi- 
tivities of which they are potentially capable. The radio- 
immunoassay and analogous protein-binding assay 
techniques originally developed for the measurement of 
insulin by Yalow and Berson (6), and of thyroxin and 
vitamin B, 2 by Ekins and Barakat (7, 8\ relied on the 
use of a labeled analyte marker to reveal the products of 
the binding reactions between analyte and binder (Pig- 
ure 3, left). This approach has subsequently often been 
portrayed as relying on "competition" between labeled 
and unlabeled analyte molecules for a limited number of 
protein-binding sites, such assays being frequently re- 
ferred to as "competitive.* 

Subsequently, Wide et al. in Sweden (9), followed 
shortly by Miles and Hales in the U.K. (JO), developed 
labeled antibody methods (Figure 3, right). These meth- 
ods represented an extension of the Habeled reagent* 
methods (utilizing radiolabeled organic compounds such 
as 131 Mabeledp-iodosulfonyl chloride, [ 3 H]acetic anhy- 
dride, and other similar reagents) devised, during the 
early 1950s, by Keston et al. (J J), Avivi et al. (22), and 
others for quantifying amino acids, steroid and thyroid 
hormones, etc. Although radiolabeled antibody methods 
(immunoradiometric assays; irmas) were originally 
claimed (13) to be more sensitive than methods based on 
the use of radiolabeled analyte, these claims were sup- 
ported by neither rigorous theoretical analysis nor per- 
suasive experimental evidence, and for some time re- 
mained controversial. Further doubt on their validity 
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anaJyte ♦ antibody* « analyte : antibody* (B) 

* reslduaJ analyte 

* itaduil antibody* (F) 



Measure "fraction bound* (B) 
Measure "fraction free" (F) 



s F SaL^T ,y,e ^ "*"■«■•* MM assay 

(although the libeled anaryte « rw iMawrtTS^.^ a , nfi L body 

ol which fraction <X the labeled eneiySls ^Z^L^ '™*P«»ve 
•» optimal antibody concent^^^T™^. t^?™ 8 ^ 
•ystem tend* toward zero (aeeufrtnTe taSom^JL * 
anbbooy systems r*y on OtoeSffi S.? °>' Ubeted " 

product, of the binding reaction ^n^T?, ma f ker «° «veaJ the 
the optima! Bn^o^Z^Z^t * e «- 

toward M when the ? n3SS i!" 8 *^ ten * 

Infinity when the bound fr«*cTu de^^^Zt^*^ ^ 
beefcground) w ™" c C*ewis» assuming zero 



was cast by the publication by Rodbard and Weiss in 

*atb<*h labeled analyte and labeled antibody method! 
possessed essentially equal sensitivities. (Note- These 

the assay of smaU polypeptides, in which radioiodine 
incorporation wto the antigen molecule was restricted 
conversely, these assays would be less sensitive for 
measurement of antigens of high molecular mass ) Nev- 
«lheless, despite the appearance of this publication, the 
belief that labeled antibody methods peV se are iSrfn! 
ocally more sensitive than the corresponding labeled 
^methods gained wide acceptation! 

The reason for confusion on this issue is that the 
greater potential sensitivity of certain assay formate is 
not reajly a consequence of the labeling of antibody as 
S!! K 8 ^ 1 , kdeed, the apparent antithesis 
between labeled-analyte and labeled-antibody methods 
Averts attention from the true reasons underlying 
superior sensitivity of certain assay designs. Theoretical 
analysis (see, e.g., 4, IS) reveals that, assuming W 
fect" separation of the products of the brndinTrLction 

^'Z^^f^ af hmai *<* moieties), the 
optimal antibody concentration (for marimal sensitiV 
ty) in a labeled analyte immunoassay mvarteblytenda 

t^'J^T^^ ° { ***** bound labeled 
analyte fraction* measured, whereas in labeled-anti- 
^ methods the optimal antibody concentration de- 
pends on which labeled-antibody fraction is measure 
(see Figure 3). If the free (unreacted) antibody 
measured, the optima] concentration also tendjr/aw 
conversely, if the analyte-bound fraction is measured' 
the concentration tends to infinity. In short, of the four" 
basic measurement strategies available-labeled ana- 
lyte, with measurement of free or bound reaction m™l 
uct, and labeled antibody, also with 
free or b^und product^nly on* Demits, in practice, the 
use of anfabodv concentrations a DD roachin E infinity. 
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This particular approach may, for want of a better term 
r*?^ ^^Petitive," alftoughTmu^S 
e^asued that such terminology invoW. de^ 
from the original meanings attached tc."competitiv? 
and Uncompetitive" when these descripticTw^ fi^t 
used in the present context Indeed, as discussed 
assays may be subclassified in this ra^nerwhenTo 
labeled reagent of any kind is involved 

However, the categorization of immunoassays and 
other binding assays as competitive or noncompete 
pending on the binding agent concentratiTyTS 
maximal assay sensitivity, itself obscures the underiv- 
mg reasons for the existence of this divergence T^y 
designs, and may thus be misleading^ ^2 
may be more readily understood if the basic principles rf 



The "Antibody Occupancy Principle- of Immunoassay 

When i a "sensor" antibody is introduced into an ana- 
lyte-contamtng medium, binding sites on the antibody 

tw I T 1 ^ ttoleculeB to « Actional etot 

Aat reflects both the equilibrium constantioveraS 
the buiding reaction, and the final concentration offiSf 

Jm^^ZHS * ? C nUXtUre - ™» Position stiS 
unmediately from the Law of Mass Action, which can* 



fAbAg)/[£Ab] = .infAg] 



(1) 

««. fractional occupancy of antibody binding sites, 

fAbAgMAb] = KffAgVd + K[fAg]) (2 ) 

where jAbAg], [Ab], [fAb], and [fAg] represent the 
concentrations (at equilibrium) of bound 3SS anti! 
body, and free antibody and antigen (analyte) lesSl 

fration of free analyte generaUy depends on the concen- 
trations of both total analyte and antibody; how^ 
when total antibody approximates 0.05/ff o leT S 
TiZ^r^l^ ^^tions do noTdSTr 
^ b7 y ' ^ of ^ » 



[AbAg]/[Ah] = K[AgV(l + K[Ag]) 



(3) 



hiiT 7 ^ conce Pt have been termed "am- 
bient analyte immunoassays" (iff), fractional occupancy 
being ^dependent of both sample volume and antibodv 
concentration (see below). annoody 
Ml in^unoassays essentially depend on measure. 
ment of the ••fractional occupancy- of the «*n«Hn* 
body after its reaction with analyte (see Figure 4). 

r , e,ying ° n m «««urement of unSaTpied 
antibody bindxng sites (from which antibody occu^cv 

aZnme"? F " te - tt *> ^aTS 

attejnment of maxomal sensifa vity-the use of sensor 
antibody concentrations tending to zero; these ^ 
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JJ. 4. The antibody binding occupancy principle of Immunoas- 



may therefore be categorized as "competitive" Con- 
vereely, techniques in which occupied sites are dinetb 
measured permit On principle) the use of relatively high 
concentrations of sensor antibody and may be described 
as "noncompetitive.- This difference in JSJ 
£nply reflects the proposition that, to »inim£^rm£ 
the measurement, it is generally undesirable to^ 
sure a small quantity by estimating the different 
between two large quantities. Terence 

JSZ ^ iUufltrated * «•» 6. which 

portrays basic immunoassay formats currently in com- 

m0 ^ C °u Venti0nal ^ ^ othe «- ^ar 
anaJyte" techniques rely on measurement of «noc«l~j 
bindmg sites, generally by back-titration (eithT 
teneous or sequential) with labeled analyte but Sti 
idao^pic antibody (reactive only with un£pied £. 
on the sensor antibody) may be used fcTST sa£e 
PUrP T^^? &ae labeled Jntib^ 

iSS* ^ " B ? bo * r it8elf ^tutea thesTns^ 

„ • . *^rou^n uae ox (e.g.; an immunosorbent (com- 
P^anbgen. antigen analog, or anti-idiotypic anti- 

*5A f b ? ? Upport) - tf - separation, the 
signal enutted by Ubeled antibody bouZdZ a^aly£ 
d.e., the occupied" fraction) is measured directrvX 
assay can be classed as "noncompetitive.^ cKS* if 
one majeures the labeled antibody no/ eJSJSg* 
(i.e., that attached to the immunosorbent) the* ♦£? 
assay ia "competitive.* ^ t;> theB "»« 

JVc-site "sandwich" assays are clearly more complex 

^Z„Z n y ? t T 0antibodie3 ^ can be con^i? 
ered from two points of view. For our present purposes 
the sohd-phase antibody can be regarded as tiJvES 
antiWy, with to labeled antibody enabling the^u. 

Seen from this viewpoint, two-site assay, may be 
classed as "noncompetitive." 
These considerations emphasize that the differences 

^Sf^ B<Halled ^^ve aS nT 
competitive methods are essentially unrelated to which 
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S"L^ t f mPed ° Ve and nonca ™*«>*» immunoassay design, 

Wtoiypt^niibody ^^J^t" **> 0f ■*•••*«* 
-occup* by ^ ^ LZSTi S^SSST - 

component (if any) of the reaction system ia labeled, 
kdeed, in the caae of trar^ucer-bJeT^uZ^ 

on tLT ,Un06eMOr *J? ^ si «^cantly, depeS 
on whether a measurable signal is yielded 'by «*3 
or unoccupied antibody binding sites situaledTn S 
^ace In short, the terms •'competitive" and^onc^n 
pctibve merely reflect alternative approaches toX 

J * the optimal antibod? 

concentration required to ininimize the effects of 3 
dom errors arising in the determination 

Competitive and noncompetitive immunoassays «,„ 
be shown to differ significantly in many 

^27^ Ct£n3t1 ^ j*"** thefr sensiti^tieTi 
both types of assays, both the affinity constant (fC\ TfflZ. 
antibody and the specific activity 
tent in determining sensitivity; however, in practicl 

by the affinity constant of the antibody, whereaTS 
specie activity of the label is more importantTnon! 

KJ'.TT ^ casea, the Experimental" 
or -manipulation" error in the measurement of th* 
zartHiose response ay [i. e . ( the relative ZTlal^ 
ansmg from pipetting and other operatic^ bu?S 
including the statistical signal measurement enS £ 
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se] is of key importance in determining "potential" 
assay sensitivity (i.e. t the sensitivity obtained by assum- 
ing the specific activity of the label to be infinite, 
implying aero error in signal measurement). Thus the 
potential sensitivity of a competitive assay can be 
shown to be OCR* whereas that of a noncompetitive 
assay is given by Roo^/IAbJKRo, where, in the latter 
case, Re is assumed to represent the labeled antibody 
misclassified as bound (IbAbU commonly referred to as 
"nonspedfically bound" antibody. Thus R^AL] - f, the 
fraction of labeled antibody that is nonsperifically 
bound, and R^/IAblO^ = fa^/KT^. Assuming that 
the relative error (o^/RJ in the measurement of the 
zero-dose response is approximately identical for both 
competitive and noncompetitive assays, it is evident 
from this simple analysis that the potential sensitivity 
of noncompetitive methods is greater than that of com- 
petrtjve methods by the factor f, i.e., by the fraction of 
labeled antibody that is ^specifically bound." For 
example, if the nonspecifically bound fraction is 0 01% 
a noncompetitive strategy is potentially capable of a 
sensitivity 10 000-fold greater than that of a competi- 
tive approach, other factors being equal. 

These findings are summarized in Figure 6 (left) 
which shows the relationships between sensitivity (ex- 
pressed in terms of molecules per milliliter) and anti- 
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*LL2£2H Pr«toB0 sensigvftles of competitive and non- 
competltiv. immunoassay methods (represented by the SD of zero 
analyts measures*, expressed as moleculesSu.) ptoned £ a 
function ol antibody affinity (K) ' ^ * 

onm. Vt^^t^S^ -^i?^? t * v • •»"!»■ eateuatlone an> baaed 
m!>>i»»!?^ eiperirnantal •nor (CV) incurred In tha maatura- 

mwtolfte asMy reeponee (e-g, traction ol labeled antigen bour^M*^ 

**>*y. Irnprymo <h«t the error m the meaaurernew ot the laiel i»r ee k^T 
The «S*t* curve hdicatei Ihebee In eeraniviry a«^^^^2£ 
•tor mcurrad InoounlinQ «i diaintoortfon, JK^t^T^ 
»«. « uann «ntt>00iaa with an affinity <io« L/mol »»mS^?^lL-5T 
PnMttoa). MO* hcraaM in aensttivfty can te i^^^^^^l 

arvwaho^ratoatovaiu*;^ 

J^J^*^ww«e the much grader Ion in sensttvlty 
(ttmparsd with that potsntsOy attainable) when i radioisotopic martork 

wad. «nd the facial advantages of Mr**** u+^SSJ^LZ ' 

rBduc * d 0.1% or lass). Arms Indicate assay aawjtrSaT^SJw 
nonoompettive r/mwnoaasayi based on ,as l Itfalo rBpo "? d ** 

corrtuwr. i«Jertay the orjpinal devetapment M^TES^JK 



body affinrty » an optimized competitive (labeled ana- 
lyte assay. For tins analysis, we assume (c) the use of a 
label of uulmte specific activity, and (6) the use of »1 as 
a label, the radioactivity of the samples-being counted 
for 1 mm. Computations of the theoretically optimal 
reagent concentrations (on which calculations repre- 
sented m ...Figure 6 rely) were based on the further 
assumptions that (c) the radioactivity of the antibody, 
bound labeled-analyte fraction was counted and Of) the 
(relative) "experimental error" component in the mea- 
surement of the bound fraction (cyb) was 1%. Given 
these assumptions, the "potential" sensitivity attain- 
able in such an assay is oytfb, where K is the affinity 
constant of the antibody. rFor example, if the affinity 
constant as 10" L/mol, and ajb is 0.01 (1%), maximal 
assay senstivity is 10~" mol/L, or -6 x i 0 « molecules/ 
mL.] The additional "signal measurement error" arising 
m consequence of counting radioactive samples for a 
finite time implies a loss of assay sensitivity, as shown 
by the upper curve in Figure 6 (left). However, the 
resulting loss in sensitivity is relatively small for anti. 
todies of affinities <10» L/mol, and is negligible for 
antibodies with affinities <10" L/mol. In other words if 
the essayist can accept individual sample counting 
, T 5 litUe improvement in sensitivity* 
gained b/' using alternative labels of higher soecifie 
activities ; th« «L However, similar co^erSot 
J* 1 "dioiwtopic labels of much lower specific 
actmty than *»I ( e .g., *H) may limit the sensitivities* 
the assays (such as steroid assays) in which they are 
used, notwithstanding the use of relatively long sample 
counting times. 

The other main conclusions stemming from such 
analysis are the importance of both iniriimMng "manip- 
ulation" errors and using antibodies of high binding 
affinity. For example, an increase in aJb to 3% imoliea 
an approximate threefold loss in sensitivity, notwith- 
standing the fact that an assay reoptimized in response 
to the deterioration in operator skill that these numbers 
imply would utilize less antibody and labeled analyte 
thereby partially offsetting the consequences of poor 
pipetting. But the most important conclusion emerging 
from tie analysis is the near impossibility, in practice 
immunoassay sensitivities better than' 
about 10 7 moleculea/mL by using a competitive ap- 
proach, irrespective of the nature of the label used, if one 
assumes an upper limit to antibody binding affinities on 
the order of 10" L/mol. 

The results of a similar analysis of the sensitivity 
umtataoM applying to noncompetitive (two-site) assays 
(25) are illustrated in Figure 6 (right). Two sets of 
curves are portrayed here, corresponding to the assump- 
tions of 1% and 0.01% nonspecific binding of labeled 
ajitibody to the capture-antibody substrate. Such anal- 
ysis likewise yields important conclusions relevant to 
assay design, e.g., th crucial importance of reducing 
nonspecific binding of labeled antibody to an absolute 
minimum. Furthermore, if nonspecific binding is re- 

nW t« ~n.ni*, W M hi*h Kenritivitv « »^; woW o 
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^ncompebt,ve assay des^n as by using an antibody of 
f-10* I^mol in B competitive method One of the m J 
^p^t conclusions is that the sensitivitil*^ 
ta% attainable with high-affinity antibodies 

~ A * ^oiaotopicaUy based 
methods, which (because of the relativplv J!~T 
activities of isotopes such as "»D areSSJT ^ 
to sensitivities of the orierof lO'Tf.^^ ? PraCtlce 
more. In short, although ^ c^£2^ W 
^competitive J^yU^Xt^^l 

of the same antibody in each methodolo^th e ^!f 

with affinity of .bw l^-itf^,* "* rf «*>T»ii« 
•ita. piquet™ of mmJZJ „?k5 S?* 8 * 1,1 

f high-epeofic activity labela (Table 1 ^ 
Against this background, let im * , 

»»»»»—»■ «rf multiply,. 
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f - FfCMAJ,]) ♦ (fAnMAbj) + „ + . „ (<) 



Ambient Analyte Immunoassay 

iMHat 0.5/JT i. rewired to iiS? t ?"" ,n .; i, ! ,m - 



«... IJ^cfcp iim. - **^ to ^ „ ~ 

Sp*CfflC activity 

^^Sf.^nt per second per 
7.5 x io« labeled rfiolecules 

Determined by enzyme "amplifica- 
tw ifactor" and delegability of 
reaction product 

'daeaabte event per «*d 
^- u «ec«ab.eeve« S p wtebeled 



125| 

Enzyme label 

Chemilumlnes^Bnt label 
Fluorescent label 



less than (sav) 0 m /jr /*u , y concentration of 

to « . udnw-uik ^^ffSs^sr 4 : 

bonal) occupancy of antilwfc result Wg (frac- 

fiyatem. (If, for example, « £J5 ^ * *** 
binding-site concentritira of 0 01/ir^ ' " 
10-" mol/L. or 6 0^ St k- X ^ "P^nts 0.01 x 
binding 1, lflT ]«*»» «tes^nL.) Analyte 

small, the resulting reductio?^? T 0UDt bouad " 
tion of analyte iT S5<f "^"^"at concentre- 

<0.0LUT, analyte deEi ?i ******* is 

that Uu. »nv en tio^^r.^r i ^ Uned to ia S 
Tbey do not refer to molar cobc^SL denV "« 

"ffituty (awtant) for analyte^^'iT^r^y P°««»« «n 
10 11 moW, (represented to units of MPliTi concentr « ti on of 
Thu,. fractional occupancTc^ bi2 1 (dan . ""nleaa) unit 
cal for otf antibodies tf thi w^Tf^ °? * <IU " tioB 4 «« Menti. 
tion u, adopts- i. e ., « ^ concent™. 
be identical for ayrtem. Jr''^ ^^ntration will 

antibodj with an affinity ofS 1 lL,Mn^o ron " nt ? ti ™ «f «« 

^tbana^tyofToui^O^oW f M °[^^^J 
affinity of 10» L/mol, etc^roidedS^ *? «» 
expressed in the same manW) concentration is 

P^S^Xi™^"^ ^ oca, 
s ite8 ar« exposed, nTSo^ t ^^S 8Dti ^^ 
> *. the systen, » iad^TJ.^J^ «««b.tion tube; 
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441 Antibody concentration *' 4 

competitive immunoassays art conventionally designed to IL ^*£v£ 
dent of sample volume. 

♦w The !f^ nclu8i0M lead to two concepts First 

the antibody may be confined to a ^crospo? on a soM 

site within the microspot is <uflf x i 0 "« x N, whereT 
: ^.^P 1 * ™ lun " to which the microspot is exposed 
to . aiUfatetf and AT = Avogadn/s numb£ (J x^f 
For example, if. = 1 and* = iq» L/ mo i, aJJJ 




D,- 2 . rt -03 u=3 1/ST . U: 23/ K0. 4862209286 p „ 

n^^uaber of binding sites that will cause neg- 

Tatiometnc," microspot immunoassay. 

Dual-Label Microspot Immunoassay 

After exposure of a microspot of antibody (located ™ „ 

ESM* 10 1! ^^tainin^^F^ 
ure 8, left), tte probe may be removed id exposed tea 
whition 1 ^taming a high concentration ofV^ebp! 
aj- antibody directed against either a second eS 
<xe the occupied site) on the analyte molecule JfE 
molecuJeis large, or against unoccupied birJu^sitesor! 

m the case of Email aiidXmoS^ 

anybody may thus be estimated by measuring the raS 
of sensor and developing antibodies tStto the dS 
SS^ y + K C0Up,etS " ™ B readily achieved by 

«IW ELT"* ^ devd ^ antibodiestiS 
Querent labels, e.g., a pair of radioactive, enzyme, or 

Aem.tanescent markers (or even labels ofemSefr 

duTerent nature) Fluorescent labels are potential? 

tKularly usekl fa this context because by the uW 

optical scanning techniques (Figure 9), they permit the 

ecanning of arrays of antibody Wspots" Kuted 

over a surface (each microspot directed^einet a dtf£ 

ent analyte , so that multiple analyte assays may be 

performed simultaneously on the same sample Several 
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Non-compelltlw assay CompeUtlve assay 

R*. 8. Microspot immunoassay: (left) first Incubation, with 1 : , 
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fl ' — •"^z^Sr^ 

eoaW . monomer *^%^Zg^«'»** 
advantages stem from adopting a dual fluorescence 

field of vjew is important, because the r«ti n ^f*u 

tuabons in the intensity of the incident S'l l 
beam are apt to be of little significance TVm!!) 
tages are additional to the b^!?^'. advan * 
this apprnach, i.e., t£ ^e^^^T^^ 
stancy of the amount of m^S^Sl X 
assay system is removed. 10 the 

Microspot Immunoassay Sensitivity 

osition that microspot assays may be at least M J»n«T 
bve as conventional systems that rely oTfi^i!^* 
amounts of antibody rLy reac% be dLl^f iFF 
consideration of a mod/syetem 
sensor antibody molecules are attached toC^c^f 

antibody is distributed uniform^ titug^e ^ 
bation mature.) Let us also suppose tffij antin^v" 

surface density on the support and (to simplify oW 
aon) are unlabeled, Then a change in the ciStaS 

of sensor antibody implies a corresponding cWe i n 
Je surface area over which the antib^yTdiSTbuted 

vJX ^ tih0dy **** constant is i^' 

L/mol, the total incubation volume i fl i .T.", 1 
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a surface area of 10 5 pm 3 «. q i 

antibody binding sites^r^^ * accommodates 
of 0.1/* ana^f ^^P 00 ^ to a concentration 

, "^J™ at a concentration of O.OLK fie 6 x in* 
moleculea^nL), we measure Wmpetitive^ tL rl 
suJbng^tibody occupancy (ejr bvS^T!l* 

a^l^^ 

^^^^^ 

developing antibody, ^fSTtt^^X^ 
specify- to the solid support iS a?t^T* 
density of 1 molecule/^un 2 . Surface 
We may now consider the effects rf. 

equabon 4, the value of P f A r tv* i 2 . ' Yoia 
in-3 rm. e 01 f ,or the 1 nun 2 area is 4 Sfl * 

area is 2.99 x 10 7 d. e ., about 50% of the total analvJ 
molecules present), whereas the number 
t.body molecules nonspedfically bound is lT 

zero fi- tkT;li 110186 instrument tselftobe 

(, - e ' th€ ^ of background is the no* 
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specifically-bound labeled antibody within the instru- 
ment's field of view), the signal/noise ratio observed for 
the 1 mm 8 area is -30. Similarly, the value of Pfor a 0 1 
mm 8 area ia 9.02 x 10"», the number of labeled anti- 
body molecules specifically bound to the area is 5.41 x 
10 6 , the number nonspecifically bound is 10*, and the 
signal/noise ratio ia -54. Likewise, the signal/noise 
ratio for a 0.01 mm 8 area can be shown to be -59. In 
short, the signal/noise ratio increases as the antibody- 
coated surface area is decreased, approaching a man. 
mal (plateau) value of 60 as the area coated with sensor 
antibody falls below 0.01 mm 8 and tends toward zero 
If, however, a reduction in the antibody-coated area 

r i ! 1 , ^ CCOmpanied by , 8 ******** reduction in 
the detecting instrument's field of view, the resulting 
reduction in "signal" would nor lead to a corresponding 
decrease in the background generated by non^edfi! 
cally-bound developing antibody (Figure 10b). TW 
fore, although reduction in the coated area would in- 
crease the fractional occupancy of the sensor antibody 
the signal/noise ratio might either remain constant or 
fell. In these circumstances it might be advantageous to 
ncrease the coated area. Similarly, if the surface den- 
fflty of sensor antibody were decreased (the coated area 
bemg held constant), similar conclusions woukTbl 
reached (Figure 10c). 

twJ^S: * ^ back f™** «Pul generated within 
the detecting instrument itself (e.g., from the photocath- 
ode of a photomultiplier tube used to detactphoW 
emitted from the antibody-coated area) were not ^ 
and remamed constant regardleBs of the instrument's 

flljll^' ****** «^oiae ratio would 

also be attained at some optima] value of the antibody, 
coated area, below which the ratio would fall Because 
however, one can generally reduce the size of the detector 
(and hence the detector-generated background) at the 
same rate as the size of the signal-emitting area, there is 

*J e T^ pri ^ le T for ««na^noiae ratio to 
« the.antftody-coated area is progressively 
reduced toward zero. Thus ifwe.accept the sigrXri.; 
raboas indicative of the precision of the measurement of 
antibody occupancy (and hence of assay sensitivity) 
Aese considerations suggest that it is advantageous ,to 
reduce the antibody^ted surface area 
tantly, tte sensor-antibody concentration) toward zero, 
ahhough little advantage ia likely to accrue from r£Z 
"* ***** Wow 0.01 mm 8 (and thus the antibody 
concentration below 0.01/JE). y 
Were the microspot area indeed reduced to zero, both 
signal and noise would likewise also fall to zero (the 
ratio between them nevertheless remaining essentiallv 
constant), implying that no signal of any kind would? in 
we Imut, be recorded. In practice, other statistical 
actors come into play when the number of individual 
events (e.g., photons) observed by a detecting instru- 
ment is very low, thus prohibiting a reduction of the 
*™™ hh ?y cancentntian to zero. The point at 
which the reduction in the antibody<oated areTcauses 
tK. .i™, to be lort aufficientlv to affect 



precision of the measurement of antibody occupancy 

ix? ^ y on ** spedfic **** «**ssz 

anybody used to measure the occupied binding sites: the 
higher the specific activity, the smaller the^nniasible 
area. Thus, given labels of very high specificity 
one can envision circumstances in which, even in a 
Noncompetitive- system, the optimal concentration of 
sensor antibody may be exceedingly low. A more sen- 
era! conclusion is that a variety of factors, including : the 

ttfJtSf 08 °l the inst ^ meDts »r meaning 
the labeled antibody (or labeled analyte), influence 
unmunoassay design, implying, among other things, the 
virtual impossibility of formulating general rules re- 
garding this. For example, reagent concentrations that 
are optimal for isotopically labeled reagents used with a 
conventional radioisotope counter (possessing a fixed 
background dependent on its basic construction) are 
hkely to be entirely different when very high-specific! 
activity labels are used and one has th e freedom totSlor 
the measuring instrument to samples of any size. In 
abort, certain conclusions based on experience of RIA 
and rflMA techniques may prove misleading when ap- 
Plied to nomsotopic methodologies, and should be 
viewed with caution. 

A more detailed theoretical consideration of (noncom- 
I este that™'* 061 ' 01 immmjoas8a y sensitivity (21) sug- 



*m* * t(6 x lO^Xl + [Ab*])J/DJf[Ab»] (5) 

ar,ti£fv ***** «te*W) of sensor 

antibody, K = sensor antibody affinity (Unol) fAb'l » 
concentration of labeled antibody in developing solution 

^ZTL? ° {yK *> where ** " labeled antibody 
^*>>? <** = minium detectable surface density 
of labeled antibody (molecules/Mm 8 ), and C^- S 
detection limit (moleculea/mL). For example?* 1X6*1 - 
U> = 10'molecdesW,*- 10 » iaJ^JjFj Z 
20 molecules/^ then - 2.4 x l 0 « molecule^mL 
wi- V ^ aDd ^onal occupancy of the 
bmding sites of the sensor antibody by the minimum 
detectable concentration of analyte is 0.04%. Figure 11 
shows the theoretical assay sensitivities attainable with 

zs^sc 0 ^ of variom piotted m * 

A similar theoretical analysis of competitive micro- 
spot immunoassay indicates that potential sensitivities 
are essentially identical to those attainable S?£ 
ventiona] competitive methodologies. In summary, the 
above considerations indicate that the attainment of 
high microspot assay sensitivity requires close packing 
of molecules of sensor antibodies within the microspo? 
area, coznbined with the use of an instrument capable* 
accurately measuring very low surface densities of de- 
velops antibodies..Tney also suggest that (a) micro, 
spot assay aens,favities considerably higher than those 
obtainable by conventional isotopically based immu- 
noaasays are achievable, and (6) if labels of very high 
specific activity are available, the sensitivities rteldri 
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situated within the Scro^^ of antibody 
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homogeneous h^d-phaWe^^tz^ °t" nr * » a 
effective concenSbon^e^n^-* 
bon medium is exceedii^^^Jf incuba - 
wbch sensor antib^Z&^i^ 0 ^ rate « 
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» mind the relationship B5£ £2 ^ b " ri »« 
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to diminish with future improvement in photomultiplier 
design. Other sources of background include fluores- 
cence emitted by components in the optical system, 
which may not, in current instruments, have been 
constructed with background reduction as a prime con- 
sideration. Nevertheless, they detect with high sensitiv- 
ity fluorescent signals. For example, one commercially 
available microscope is claimed to detect fluorescein at a 
density of 10 molecules'/urn 2 . Most commercially avail- 
able fluorescein isothiocyanate (FTTOlabeled IgG ex- 
hibits a fluorophor/protein ratio of -4; this implies 
detection limit (D*^) for antibody Burface density of 
two or three FTTC-labeled IgG molecules per microme- 
ter 3 . This, in turn, implies a theoretical sensitivity for a 
two-site immunoassay of x lo 5 analyte molecules 
per milliliter, assuming identical parameter values as 
above, or 2-3 x 10 4 molecules/mL if the sensing anti- 
body has an affinity of 10" L/mol. Clearly, sensitivity 
may be increased by loading more fluorophor either 
directly or indirectly onto the antibody. 

Our preliminary studies have relied on a less sensi- 
tive microscope, albeit one possessing facilities for dual- 
fluorescence measurement Its argon laser emits two 
excitation lines at 468 and 514 nm. It is thus particu- 
larly efficient in exciting blue/green-emitting fluoro- 
phores such as FTTC (excitation maximum 492 nm), but 
is less efficient in exciting fluorophores such as Texas 
Red (excitation maximum 596 nm). However, the ratio- 
metric assay principle permits considerable variation in 
detection efficiencies of the two labels because the spe- 
cific activities of the labeled antibody Bpecies forming 
the antibody couplets can be chosen to yield signal 
ratios approximating unity. Inefficiency of the argon 
laser in exciting Texas Red is thus not a m^jor handicap 
in this context. Though this instrument relies on a 
conventional microscope and not on an optical system 
designed for this purpose (and thus implicitly less sen- 
sitive), it permits quantification of fluorescence signals 
generated from microspota of any selected area. Initial 
studies have revealed that, under conditions that are 
not optimal, the instrument is capable of detecting -25 
FTTC-labeled and (or) 150 Texas Red-labeled IgG mole- 
cules per micrometer 2 , while scanning an area of -50 
Mm 2 . 

The development of microspot immunoassays has also 
necessitated closer scrutiny of the mechanisms involved 
in the coupling, of antibodies to solid supports. In the 
present context, these should display a capacity to 
adsorb (in the form of a monolayer)-** to covalently 
link-^a high surface density of antibody combined with 
low intrinsic-signal-generating properties (e.g., low in- 
trinsic fluorescence), thus miniminng background. We 
have examined a number of candidate materials, such 
as polypropylene, Teflon*, cellulose and nitrocellulose 

membranes, microti ter plates (clear polystyrene plates; 
black, white, and clear polystyrene plates), glass slides 
and quarto optical fibers coated with 3-{amino propyl) 
triethoxy silane, etc, and eev ral alternative protocols 
for achieving high monolayer coating densities. These 



studies have exposed phenomena neither evident nor of 
importance when antibody binding to solid supports ia 
examined at a macroscopic level. Provisionally, we have 
used white Dynatech Microfiuor microti ter plates— 
formulated f r the detection of low fluorescence signals, 
and yielding high signal/noise ratios and high coating 
densities of functional antibodies (-5 x lo 4 IgG mole- 
culesVmVfor assay development, although such 
plates are not ideal. Indeed, deficiencies in the antibody- 
deposition methods used constitute the principal source 
of imprecision in assay results and the limitation in 
sensitivity that this implies. Clearly, this represents an 
area for further study and refinement of current coating 
techniques. 

Notwithstanding the limitations of present instru- 
mentation (which, among other things, does not permit 
the use of time-resolving techniques to distinguish two 
individual fluorescence signals either from each other or 
from background fluorescence) and the crudeneas of 
present methods for coupling antibodies onto small 
areas, we have verified the theoretical concepts outlined 
above by comparing the performance of several aasays 
when constructed in microspot format and when conven- 
tionally designed. Although unoptimized, ratiometric 
microspot assays have yielded sensitivity values closely 
approaching those of conventional optimized IRMA. As 
an example, the results of a ratiometric assay system for 
thyrotropin, with use of Texas Red- and FTTC-labeled 
antibodies, are shown in Figure 13. Bearing in mind the 
well-known limitations of these and other "convention- 
al" fluorophon when used as immunoassay reagent 
labels, such results are encouraging, although further 
work is clearly required to achieve the considerably 
greater sensitivity theoretically predicted with use of 
improved fluorophors, better anUbody-microspotting 
techniques, and purpose-built (time-resolving) instru- 
mentation. 

The finding that highly sensitive immunoassays can 
be performed with far smaller amounts of antibody than 
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are currently used conventionally permits in turn the 
construction of antibody xnicrospot arrays enabling, in 
principle, the simultaneous measurement of thousands 
of different substances in 1-mL samples. In collabora- 
tion with investigators at the Centre for Applied Micro- 
biological Research, Porton Down, U.K., we are pres- 
ently developing various techniques for the creation of 
such arrays. Indeed, similar technologies have recently 
been used for the parallel synthesis of several different 
polypeptides, these enabling 10 000-microspot arrays to 
be constructed on silica chips approximating 1 cm 2 (24). 
Although arrays of this capacity are unlikely.to ever be 
required for conventional diagnostic purposes, we can 
anticipate that the ability to simultaneously measure 
many substances in the same sample will have revolu- 
tionary consequences in medicine and other similar 
areas. In addition, such techniques may ultimately 
permit the individual analysis of the multiple isofonns 
of certain Tieterogeneous* analytes (e.g., the glycopro- 
tein hormones), such molecular heterogeneity currently 
presenting a mayor obstacle to the standardization and 
interpretation of many immunological measurements 
(25). Moreover, although these concepts have been illus- 
trated in an immunoassay context, they are clearly 
applicable to all "binding assays," including those rely- 
ing on the use of DNA probes,, hormone receptors, etc. 
For example, labeled lectins that are specific in their 
reactions with the sugar residues in the oligosaccharide 
chains of glycoprotein molecules may be used, together 
with specific antibodies, to impart additional "structural 
specificity* to sandwich assays (26, 27), possibly over- 
coming the limitations of antibodies per se in regard to 
differentiation of the glycosylate variants of the gly- 
coprotein hormones. 



Summary and Conclusion ,J 

Because of past confusion regarding the concepts of 
precision, sensitivity, accuracy, etc., several erroneous 
concepts have become incorporated within currently 
accepted rules of immunoassay design. In particular, 
much higher antibody concentrations are customarily 
used than are necessary to achieve very high assay 
sensitivity, provided that certain measurement strate- 
gies are adhered to. In this presentation, we have 
attempted to show that, in principle, the highest assay 
sensitivities are obtained by confining a small number 
of sensor antibody molecules onto a very small area in 
the form of a xnicrospot and measuring their occupancy 
by an analyte, by using very high-specific-activity "de- 
veloping" antibody probes, thereby maximizing the sig- 
nal/noise ratio in the determination of sensor antibody 
occupancy. This observation, which contradicts cur- 
rently accepted immunoassay design theory, in turn 
makes possible the measurement of an unlimited num- 
ber of different analytes on a chip of very small surface 
area through the use of, e.g., laser scanning techniques 
closely analogous to those used in compact disk tech- 
niques of sound recording. Extensive experimental stud- 
ies in this area, albeit conducted with relatively crude 
techniques and instrumentation not specifically de- 



signed for these purposes, and therefore not reported in 
detail here, have demonstrated the feasibility of the 
n^aturised antibody microspot approach and the va- 
Udity of the general concepts on which it is based. We 
are therefore confident that this represents the basis of 
a nert-generation technology that is likely to have a 
revolutionary impact on all fields involving the use of 
binding assays. 
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Corrections 

Vol 37, pp. 1447-8: In our desire for rapid publication 
important errors were introduced into the following 
Technical Brief. The corrected version is here repro- 
duced in its entirety, with our apologies to the authors. 

Rapid Detection of 171 7-1 G-> A Mutation in CFTR Gene 
by PCR-Mediated Site-Directed Mutagenesis, Laura 
Crzmonesi,' Manuela Seia,' Carmelina Magnani, 1 and 
Maurizio Ferrari 1 0 Istituto Scientifco H.S. Raffaele, 
Lab. Centrale, Milano; 2 Istituti Clin, di Perfeaonament© 
Lab. di PJcercbe Clin., Milano, Italy) ' 




Until now, among the non-AP508 mutations identified in 
the cystic fibrosis transmembrane conductance regulator 
(CFTR) gene by the Cystic Fibrosis (CF) Genetic Analysis 
Consortium, the ones most frequently seen in our popula- 
tion sample are the 1717-1G-»A mutation (13/144 or 9% of 
the CF chromosomes) and the G542X mutation (167190 or 
8.4% of the CF chromosomes), both revealed by dot-blot 
hybridization of the polymerase chain reaction (PCR) prod- 
uct with allele-specinc oligonucleotides (ASO) probes (J). 

In an attempt to simplify the analysis of the most 
frequent mutations in the CFTR gene, we converted radio- 
labeled ASO detection into restriction endonudease anal- 
ysis of the amplified product 

A PCR-mediated site-directed mutagenesis (2, 3) to de- 
tect the G542X mutation by generating a novel BstNl site 
in the wild-type sequence had already been suggested (4) 
To detect the 1717-1G-»A mutation, we designed the 
reverse primer (5 -CTCTGCAAACITGGAGAGGTC-3') to 
contain a single-base mismatch (T-*G), which could create 
a novel Avail restriction site [G J G(A/T)CC] in the am- 
plified wild-type (WT) allele but not in the CF mutant (M) 
allele: 



WT: WT 1717 

I- 



fig. 1. Detection of the 1717-lG-A mutation by PCR 
RwetiocB were carried out with 1 ^ et genomic DNA ki a total volume ol 100 
moo,, 0.1 g/l gstatm, 200 nmoVL each of the tour aeo*yrtbonudeatida 
CT), and 1 oo pmol of each of the primers. PCfl conditions were sstotawt"- 

£ i iTT LW"? ,nd elec WPho»Md on 3% agaroee-1% NuSieve gel 
tori h at SO V. Bands were made visible by stainta the eel wtrhetrSkJIi 
bromide. Lane 7: /feeUHJIgested pBKi22»tolZ*?< uT? 
r^ygoje. Lmk 3.CF patent homwygou, J^7-^A rnJSSf 
Lane 4: heteroiygote carrier tor the 1717-1Q-.A mutation mum ^- 



For the forward primer, we used the one made available 
by the CF Genetic Analysis Consortium to amplify exon 11 
of the CFTR gene: 5'-CAACTGTGGTTAAAGCAAT- 
AGTGT-3'. 

Digestion by A veto enzyme of the PCR product generates 
two fragments of 116- and 21 -bp in the wild-type alleles 
and leaves undigested a 137-bp fragment in the mutant 
alleles (Figure 1). 

By combined analysis for the AF508 mutation (5) (252/ 
470 or 53.6* of the CF chromosomes), 1717-1G-»A, and 
G542X, about 71% of mutations might be detected by 
nonisotopic analysis of the PCR product, thus allowing a 
faster and easier one-day procedure for carrier screening 
and prenatal testing. 
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A method for determining the ambient concentrations 
of a plurality of analytes in a liquid sample of volume V 
liters, comprises 

loading a plurality of different binding agents, each 
bang capable of binding specifically and reversibly 
an analyte of interest onto a support means at a plural- 
ity of spaced apart locations such that not more than 
0.1 V/K moles of each binding agent are present at 
any location, where k liters/mole is the equilibrium 
constant of each such binding agent; 
contracting the loaded support means with the sample 
to be analyzed, such that each of the spaced apart 
locations is contacted in the same operation with the 
sample, the amount of sample liquid being such that 
only an insignificant proportion of any analyte pres- 
ent in the sample becomes bound to the binding agent 
specific for it, and 
measuring a parameter representative of the fractional 
occupancy by the analytes of the binding agents at 
the spaced apart locations by a competitive or non- 
competitive assay technique, using a labelled site- 
recognition reagent for each binding agent capable of 
recognizing either the unfilled binding sites or the 
filled binding sites on the binding agent, which ena- 
bles the amount of said reagent in the particular loca- 
tion to be measured. A device and kit for use in the 
method are also provided. 

17 Claims, 1 Drawing Sheet 
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, , _,_«.. which is a monoclonal antibody may, for example, have 

^ZZ^f^JPL 0 ** 1 ** 112 * 1 * equilibrium constant (K) which is of Aeri?of 

CONCENTATION OF SEVERAL ANALYTES 10" liters/mole for the spedfiT^tigea to wSch ft 

— . . . . . _ binds. Thus, under the above generally accented ones- 

5*8 PCT/GB88/0O649, Aug. 5, 0 f 10-" mole/liter or more is required for binding 

agents of such an equilibrium constant and, with fluid 
FIELD OF THE INVENTION sample volumes of the order of 1 milliliter, the use of 

The present invention relates to the determination of I0 W or m0TC molc of bindin £ agent (or site) is conven- 
ambient analyte concentrations in liquids, for example Penally deemed necessary. Avogadro's number is about 
the determination of analytes such as hormones, prote- 6x 10 23 so that 10- 14 mole of binding site is equivalent 
ins and other naturally occurring or artificially present t0 morc tnan 10 9 molecules of binding agent even as- 
substances in biological liquids such as body fluids. suming that the binding agent possesses two binding 

BACKGROUND OF thp TMVEvnnv is pcr molecuk - For specific binding agents of the 

BACKGROUND OF THE INVENTION 15 very highest affinity K is less than 10" liters/mole so 

I have proposed in International Patent Application conventional practice requires more than 10 7 mole- 

WO84/01031 to measure the concentration of an ana- cules of binding agent, whereas binding agents with 

lyte in a fluid by contacting the fluid with a trace lower affinity of the order of 10* liters/mole necessitate 

amount of a binding agent such as an antibody specific the use of more than 10 1 * molecules under conventional 

for the analyte in the sense that it reversibly binds the practice. In fact all immunoassay kits ™ Q rV^ com- 

analyte but not other components of the fluid, determin- mercially at the present time conform to these concepts 

ing a quantity representative of the proportional occu- and use an amount of binding site approximating to or 

pancy of binding sites on the binding agent and estimat- more frequently, considerably in excess ofV/K: indeed 

ing from that quantity the analyte concentration. In that m certain types of kit relying on the use of labeUed 

apphca^ antibodies it is conventional to use as much bind£ 

binding agent is sufficient* ow that its introduction agent as possible, binding proportions of analyte «£ 

into the fluid causes no significant diminution of the exceedm* 50% anaiyic grcauy 

f^^Jl^ 1 ^^ ""^ ^ Becaui of the binding of substantial proportions, for 
of binding agent, i,e. independent ^aS^S^ 

error usually associated with the measurement of frac- V ° 1UmC ° f flmd 50 for accura * 

tional occupancy. In such circumstances, and in these \ uaDtl J atlve « 15 necessary to control accurately 

circumstances only, the initial concentration [H] of 35 ! v °lumeof the sample, keeping it constant in all tests, 
analyte in the fluid is related to the fraction (Ab/Abl) of ^ hcthcr j of J thc """P 1 * of unknown concentration or of 
binding sites on the binding agent occupied by the ana- standard samples of known concentration used to 

lyte by the equation: generate the dose response curve. Furthermore, such 

systems also require careful control of the amount of 
Ab/Ab 0 -K*AH\A +K*lS\ 4, binding agent present in the standard and control incu- 

where (hereinafter referred to as K) is the equilib- ba ? 0D These limitations of present techniques are 
rium constant for the binding of the analyte to the bind- universally recognised and accepted, 
ing sites and is a constant for a given analyte and bind- ** alent Application 2,099,578A discloses a device 

ing agent at any one temperature. This constant is gen- for immunoassays comprising a porous solid support to 
erally known as the affinity constant, especially when 45 wnicn antigens, or less frequently immunoglobulins, are 
the binding agent is an antibody, for example a mono- 1)01111(3 at a plurality of spaced apart locations, said de- 
clonal antibody. vice permitting a large number of qualitative or quanti- 

The concept of using only a trace amount of binding uti ve "nmunoassays to be performed on the same sup- 
agent is contrary to generally recommended practice in P 01 ** *° r cxa ™plc to establish an antibody profile of a 
the field of immunoassay and immunometric tech- 50 sample of human blood serum. However, although the 
niques. For example, in such a well-known work as individual locations may be in the form of so-called 
"Methods in Investigative and Diagnostic Endocrinol- microdots produced by supplying droplets of antigen- 
ogy", ed. S. A. Berson and R. S. Yalow, 1973 at pages containing solutions or suspensions, the number of 
111-116, it is proposed that in the performance of a moles of antigen present at each location is apparently 
competitive immunoassay ™«"™"m sensitivity of the 35 still envisaged as being enough to bind essentially all of 
assay is achieved if the proportion of the "tracer" ana- ^ analyte (e.g. antibody) whose concentration is to be 
lyte that is bound approximates to 50%. In order to measured that is present in the liquid sample under test, 
achieve such a high degree of binding of the analyte the This is apparent from the fact that the quantitative 
theory of Berson and Yalow, to this day generally ac- method used in that application (page 3, lines 21-28) 
cepted by other workers in the field, requires that the 60 involves calibration with known amounts of immuno- 
concentration of binding agent (or, strictly speaking, of globulin being applied to the support; but this means 
binding sites, each molecule of binding agent conven- that, in the samples being tested, essentially every mole- 
tionally having one or at most two binding sites) must cule must be extracted from the sample in order for a 
be greater than or equal to the reciprocal of the equilib- true comparison to be made and hence that large 
rium constant (K) of the binding agent for the analyte, 65 amounts of antigen (i.e. the binding agent in this situa- 
i-e. [ab]> 1/K. For a sample of volume V the total tion) are required in each microdot, greatly in excess of 
amount f binding agent (or binding sites) must there- the total amount of analyte (i.e. antibody in this situa- 
fore be greater than or equal to V/K. A binding agent tion) present in the sample. 
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SUMMARY OF THE INVENTION 
The present inventi n involves the realisation that the 
use of high quantities of binding agent is neither neces- 
sary for good sensitivity in immunoassays nor is it gen- 
erally desirable. If, instead of being kept as large as 
possible, the amount of binding agent is reduced so that 
only an insignificant proportion of the anaiyte is revers- 
ibly bound to it, generally less than 10%, usually less 
than 5% and for optimum results only I or 2% or less, 
not only is it no longer necessary to use an accurately 
controlled, constant volume for all the liquid samples 
(standard solutions and unknown samples) in a given 
assay, but it is also possible to obtain reliable and some- 
times even improved estimates of anaiyte concentration 
using much less than V/K moles of binding agent bind- 
ing sites, say not more than 0.1 V/K and preferably less 
than 0.01 V/K. For a binding agent having an equilib- 
rium constant (K) for the anaiyte of the order of 10 11 
liters/mole and samples of approximately 1 ml size this 
is approximately equivalent to not more than 10 s , pref- 
erably less than 10 7 , molecules of binding agent at each 
location in an individual array. If the value of K is 10 13 
liters/mole the figures are 10* and 10 5 molecules respec- 
tively, and if K is of the order of 10 s liters/mole they are 
10" and 10 10 molecules respectively. Below ^mole- 
cules of binding agent at a single location the accuracy 
of the measurement would become progressively less as 
the fractional occupancy of the binding agent sites by 
the anaiyte would be able to change only in discrete 
steps as individual sites become occupied or unoccu- 
pied, but in principle at least the use of as low as 10 
molecules would be pennissible if an estimate with an 
accuracy of 10% is acceptable. Practical considerations 
may give rise to a preference for more than 10* mole- 
cules. 

It will be appreciated that the abovementioned GB 
patent application 2,099,378A, which for quantitative 
estimation relies on large amounts of binding agent and 
essentially total sequestration of all anaiyte, fails to 40 
recognise the advance achieved by the present inven- 
tion, which instead relies on a different analytical prin- 
ciple requiring measurement of the fractional occu- 
pancy of the binding agent and which thus requires only 
a very low proportion of the total anaiyte molecules 45 
present to be sequestered from the sample. 

Following the recognition that the use of such small 
amounts of binding agent is permissible, it becomes 
feasible to place the binding agent required for a single 
concentration measurement on a very small area of a 50 
solid support and hence to place in juxtaposition to one 
another but at spatially separate points on a single solid 
support a wide variety of different binding agents spe- 
cific for different analytes which are or may be present 
simultaneously in a liquid to be analysed. Simultaneous 
exposure of each of the separate points to the liquid to 
be analysed will cause each binding agent spot to take 
up the anaiyte for which it is specific to an extent (Le. 
fractional binding site occupancy) representati ve of the 
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a «*i«t- ; ' X v V r ™7 , ^ w " 1C compareo to ine otfter components of the liquid sample, 

anaiyte concentration in the liquid, provided only that 60 each location having not more than 0.1 V/K. prefeablv 
tne volume of solution and the analvte conc^ntratinn 



labelled with markers enabling the concentration levels 
of the separate reagents bound to the different binding 
agents to be measured, for example fluorescent markers. 
Such measurements may be performed consecutively, 
for example using a laser which scans across the sup- 
port, or simultaneously, for example using a photo- 
graphic plate, depending on the nature of the labels. 
Other imaging devices such as a television camera can 
also be used where appropriate. Because the binding 
agents are spatially separate from one another it is possi- 
ble to use only a small number of different marker labels 
or even the same marker label throughout and to scan 
each binding agent location separately to determine the 
presence and concentration of the label. By use of the 
invention considerably more than 3 analyses can be 
performed with a single exposure of the solid support 
with liquid to be analysed, for example 10, 20, 30, 50 or 
even up to 100 or several hundreds of analyses. 

Overall, therefore, the present invention provides a 
method for determining the ambient concentrations of a 
plurality of analytes in a liquid sample of volume V 
liters, comprising: 
loading a plurality of different binding agents, each 
being capable of reversibly binding an anaiyte 
which is or may be present in the liquid and is 
specific for that anaiyte as compared to the other 
components of the liquid sample, onto a support 
means at a plurality of spaced apart locations such 
that each location has not more than 0.1 V/K 
moles of a single binding agent, where K liters/- 
mole is the equilibrium constant of the binding 
agent for the anaiyte, 
contacting the loaded support means with the liquid 
sample to be analysed such that each of the spaced 
apart locations is contacted in the same operation 
with the liquid sample, the amount of liquid used in 
the sample being such that only an insignificant 
proportion of any anaiyte present in the liquid 
sample becomes bound to the binding agent spe- 
cific for it, and 
measuring a parameter representative of the frac- 
tional occupancy by the analytes of the binding 
agents at the spaced apart locations by a competi- 
tive or non-competitive assay technique using a 
site-recognition reagent for each binding agent 
capable of recognising either the unfilled binding 
sites or the filled binding sites on the binding agent, 
said site-recognition reagent being labelled with a 
marker enabling the amount of said reagent in the 
particular location to be measured. 
The invention also provides a device for use in deter- 
mining the ambient concentrations of a plurality of 
analytes in a liquid sample of volume V liters, compris- 
ing a solid support means having located thereon at a 
plurality of spaced apart locations a plurality of differ- 
ent binding agents, each binding agent being capable of 
reversibly binding an anaiyte which is or may be pres- 
ent in the liquid sample and is specific for that anaiyte as 
compared to the other components of the liquid sample, 



the volume of solution and the anaiyte concentration 
therein are large enough that only an insignificant frac- 
tion (generally less than 10%, usually less than 5%) f 
the anaiyte is bound to the point The fractional binding 
site occupancy for each binding agent can then be deter- 
mined using separate site-recognition reagents which 
recognise either the unfilled binding sites or filled bind- 
ing sites of the different binding agents and which are 
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less than 0.01 V/K, moles of a single binding agent, 
where K liters/mole is the equilibrium constant of that 
binding agent for reaction with the anaiyte to which it 
is specific. 

A kit for use in the method according to the invention 
comprises a device according to the invention, a plural- 
ity of standard samples containing known concentra- 
tions of the analytes whose concentrations in the liquid 
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sample arc to be measured and a set of labelled site- means to be such that liquid samples f approximately 

rec gnition reagents for reaction with filled r unfilled the volume V liters are readily retained-in contact with 

binding sites on the binding agents. the plurality of spaced apart locations marked with the 

In arriving at the method of the invention, I have different binding agents, For example, the spaced apart 

found that, generally speaking, for antibodies having an 5 locations may be arranged in a well in the support 

affinity constant K liters/mole for an antigen, the rela- means, and a plurality of wells, each provided with the 

tionship between the antibody concentration and the same group of different binding agents in spaced apart 

fractional occupancy of the binding sites at any particu- locations, can be linked together to form a microliter 

lar antigen concentration and the relationship between plate for use with a plurality of samples 

the antibody concentration and the percentage of anti- 10 When the support means is to be used in conjunction 

gen bound to the binding sites at any particular antigen with a measuring system involving light scanning, the 

concentration follow the same curves provided that the material, eg. plastics, for the support is desirably 

antibody concentrations and the antigen concentrations opaque to light, for example it may be filled with an 

are each expressed in terms of fractions or multiples of opacifying material which may inter alia be white or 

1/K - 15 black, such as carbon black, when the signals to be 

BRIEF DESCRIPTION OF THE DRAWINGS measured from the binding agent or the site-recognition 

_ . . , J , . t , , reagent are light signals, as from fluorescent or lumines- 

Tbe principle underlying the method of the invention cent markers. In general, reflective materials are pre- 

may be better understood by reference to the accompa- ferred in this case to enhance light collection in the 

nying drawing which is a graph representing two sets of 20 detecting instrument or photographic plate. The final 

curves plotting the relationship between antibody con- choice of optimum material is governed by its ability to 

centration and the fractional occupancy of the binding attach the binding agent to its surface, its absence of 

sites at certain prescribed antigen concentrations and background signal emission and its possession of other 

the relationship between antibody concentration and properties tending to nuuimise the signal/noise ratio for 
the percentage ofanugen bound to the binding sites at 25 the particular marker or markers attached to the bind- 

the same prescribed antigen concentrations. Each curve ing agent situated on its surface. Very satisfactory re- 

relates to the antibody concentration [Ab], expressed in suits have been obtained in the Examples described 

terms of 1/K, plotted along the x-axis. For the set of below by the use of a white opaque polystyrene microli- 

curves which remain constant or decline with increas- ter plate commercially available from Dynatech under 
ing [Ab], the y-axis represents the fractional occupancy 30 the trade name White Microfluor microliter wells 

(F) of binding sites on the antibody by the antigen; for The binding agents used may be binding agents of 

the second set, the y-axis represents the percentage different specificity, that is to say agents which are 

(b%) of antigen bound to those Binding sites. The indi- specific to different analytes, or two or more of them 

vidual curves in each set represent the relationships may be binding agents of the same specificity but of 
corresponding to four different antigen concentrations 35 different affinity, that is to say agents which are specific 

[Ann] expressed w terms of K, namely 10/K, 1.0/K, to the same analyte but have different equiHbriumcon. 

O.l/K and 0.01/K. The curve show that as [Ab] falls F sums K for reaction with it. The latter alternative is 

reaches an essenUally constant level, the value of which particularly useful where the concentration of analyte 

is dependent on [An]. to be assayed in the unknown sample can vary over 

DETAILED DESCRIPTION 40 considcrablc rai3 ^ cs ' for example 2 or 3 orders of magni- 

^ ^ . tude, as in the case of HCG measurement in urine of 

The choice of a solid support is a matter to be left to pregnant women, where it can vary from 0.1 to 100 or 

the user. Preferably the support is non-porous so that more IU/ml. 

the binding agent is disposed on its surface, for example The binding agents used will preferably be antibod- 
as a monolayer. Use of a porous support may cause the 45 ies, more preferably monoclonal antibodies. Mono- 
binding agent, depending on its molecular size, to be clonal antibodies to a wide variety of ingredients of 
earned down into the pores of the support where its biological fluids are commercially available or may be 
exposure to the analyte whose concentration is to be made by known techniques. The antibodies used may 
determined may likewise be affected by the geometry of display conventional affinity constants, for example 
the pores, so that a false reading may be obtained. Po- 50 from 108 or ^liters/mole upwards, e.g. of the order of 
rous supports such as nitrocellulose paper dotted with 10"> or 10" liters/mole, but high affinity antibodies with 
spots of binding agent are therefore less preferred, Un- affinity constants of 10«-10» liters/mole can also be 
like the supports used in GB 2,099,578A, which seem to used. The invention can be used with such binding 
need to be porous because of the large number of mole- agents which are not themselves labelled. However, it is 
cules to be attached, the supports for use in the present 55 also possible and frequently desirable to use labelled 
invention use much smaller quantities and therefore binding agents so that the system binding agent- 
need not be porous. The non-porous supports may, for /analyxe/site-recognition reagent includes two different 
example be of plastics material or glass, and any conve- labels of the same type, e.g. fluorescent, chemiluxnines- 
ment rigid plastics material may be used, Polystyrene is cent, enzyme or radioisotopic, one on the binding agent 
a preferred plastics material, although other polyolefins 60 and one on the site-recognition reagent. The measuring 
or acrylic or vinyl polymers could likewise be used. operation then measures the ratio of the intensity of the 
The support means may comprise microbeads, e.g. of two signals and thus eliminates the need to place the 
such a plastics material, which can be coated with uni- same amount of labelled binding agent on the support 
f nn layers f binding agent and retained in specified when measuring signals from standard samples for cali- 
locations, eg. hollows, on a support plate, Alternatively 65 oration purposes as when measuring signals from the 
the material may be in the form f a sheet or plate which unknown samples. Because the system depends solely 
is spotted with an array of dots of binding agent. It can on measurement of a ratio representative of binding site 
be advantageous for the configuration of the support occupancy, there is also no need to measure the signal 
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support with a solution of the bindfal^KXS £*?r V - . dther ae "^y* 

of .'small drop, e.g. 0.5 .-cJK.TS&'S; S*TT245 d ^° r f "** 
allowing them to remain in contact for a oeriod ha Kim* ,n T, t0 v .f?™? molecule -«-8- * protein 

before hashing the drops awaTA rouS coL^ ^° Wh,Ch * d ™ c,ly or The 

sman fraction If the bindL ajent preSin toe d™ s, ": re ~« mB «» rea *«« "V be labeUed directly or 

becomes adsorbed onto Uufsu^on afT resume Z fi^^"?"*? *T~* laWs SUch " 

procedure. It is to be noted that the coatinTde^ry of f^^J^^V^ Red m aUciiais ™ ble 

binding agent on the microspot does SSL tSL is ? ?.? ^ " UOreSCMce such * eur °Pi"» 

^ the coating density ? convenl^itibodT £ £ S^SS^ — 

coated tubes; the reduction in the number of molecutes ^i n T«T,^TL? chemiluminescent, enzyme or 

on each spot may be achieved soldyTreductfon of£ ^°™ P v ^ te T 1 if "H**"** E«* 

size of the spot rather than the coating SwAhS T^ET^JS*" f P^ 3 ** labeU «» with the 

coating density is generally desSw m« 20 ~Z labdS be used in different 

nal/noLe ratios. Tie sizes of Z wtt^^Li" T* "»S«"s n,ay be specific 

geously less than 10 mm* preferably^ toanTmS" 5** ^ V* ° f **. bindm * ^^analyu spots in 

The separation is desirably" but ™u necLarify 5? S™up of spots or m certain circumstances, as with 

times thTradius of the sp£ or more ttoopntom hormones such as HCG and FSH which 

geometries can neverthdea be c^griTreS 2< ^ / n ^ ^ My ^ ^" re ^8 

being subject solely » the limitations 5, thfnSlf " ^ * 

bmdmg agent molecules in each spot, the minimum the assav whni™ a. ■ . 

Once the binding agents have been coated onto th. ^ fr0ln ? tandard apples containing known con- 
support it is conve^ticLl prance to wato toe sunnon Cen r tlons of ?»» e Such standard samples 
in toe case of antibodies JbS££££ SaTES eS o^ 0 "^ ^ """^ t0gether ' P rovided 
tion containing albumen or other pVofem to ^ttrate Si 35 SL£ F ^ ^ " P"*"" " S ° me ° f standard 
remaining non-specific adsorption siteT on t^ ^m^t P Fracu onaI occupancy may be measured by 
and elsewhere. Toconfirm Khe So™ t of toC ""TJ^* -ST" 1 bindmg ^ ™ 
agent in an individual spot wffl ITtaSL toSS ^^.f ° r ""occupied binding sites (as with 
mum amount (0.1 V/K^e^rS to coS™ to toe' ^l d ' ot yP lc "tibody). as one is toe converse of the 
principle of the present invention^ SSrfhinT <n ? /' ^ ? aCCUrBCy * * desirable to measure 
mg ag«t present on any uSSoS^SS^ f ?°T ^ ^ closer to zero because a change in 
by labelling the bmdmg agent S a det^uWe n^S Z'n ^T"^ °/ °f " P ro P orton «dy greater 
ofkao^^^^^SSSSSS ~ ^ , 7 5 ? th ^ h f< ? r fraCti0nal ^"P^ies » the 
per unit weight of binding agent) and m«surmg^£ ™* M * satisf *- 

method of the invention toe bSS^S^£Z£ ft? °" ^ P UOr f < ? nt markers - *e measurement of 

less be labelled in a trial «peSffieS3SST SST.-T^ ° f (T ° m ,WO markere - 

tions to those found in tnaurial to rive rS m?coS buidln « a 8 M t «> d the other on toe site recog- 

loadings of binding agent caTbe uHnnW ^„lf <n r° T*" 1 ' m8y 0ut a laser 

belled binding agoft k fthe s^sTbelcSu^" SS^'r^ 

The ntinimum size of the liquid samole (VI h e «ft ?k ' *-" 1 ? b ^ fr0m Bl °" Rad Laboratories Ltd., 
correlated with toe number of moHf bindL^™ ^T* ' < ChaDBtl detection sys,em - ^ 
(less than 0.1 V/K) so tlatTnly « huiSnificsu? ™ feheS on 8 beam to scan the dots or the like 
tion of the analyte present mlwSr^ « ? Tr," CaUSe fluo "*«»« of toe markers and 
bound to toe bmding agen^L Thk pro^n bfaTI ^"l* to distinguish and measure the 
general rule less tow 10%, usSy teftoa^ 5% ™d " ° f f ' uo r escence ^"ed. Time-resolved fluo- 
desirably 1 or 2% or lesTde£2g on toTacTurS SS^SLT 5 ' ^° ^ Tf" ln,crference <^ 
desired for toe assay (greater accuracy beinc S . Tf^ betw ! en 1,16 ^ channcls «« be corn- 
other thing, being eq£ wtoSto P^JorSlf 60 Z^^JZ XT^t '"T^™ " U ^ or 
analyte are bound) and the magnitude of otoe?!™r conventional efforts can be made to reduce it. Discrimi- 

one or a few ml or less, e.g. down telMnuS^ ^^ dkd ^ t accom P lls bed in toe present form 

less, are often preferred, IJfS^S^^^ VSJ£?"T' ** of distinguishing 

when larger volumes are more co^ie^ly^L^L 65 ^ ™ ydtn # h of ^ two fluorescent 

and the geometry may be adjusted aSLirS fluores «f substances may be db- 

sample maybe used at its natural conceSon evelor St „ y P 5~™ 1 cWaclerisli « "ch as 

if desired it may be diluted to a taowTenSt «f ! ,luor f e ?f ence dcca y times, bleaching times, 

■ "own extent etc ., and any ofthese means may be used, either alone Or 
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in combination, t differentiate between two fluoro- micromiiK/ml » f r*~* A « • 
ph res and hence permit measurement of the rati f tTifE? U ^^ter aliquot! of 

two fluorescent labelled ^^ngu^t J S^HJT f " <> f one to 

site-recogniti n reagent) present on !SvS ^ ntat ^ 

ent the same techniques rLy be used^dl^t^ Sr^^t * <* 

is taken to scan the entire spot in each case and the spot! ™w ? 1111,6 ^ of ^ 

contain essentially the saie amoumTbkdingS Sa fSfxt^ ^ b ~ e ad$0lbcd ° at0 ^ 
from one assay to the next when the unknown aS 10 !1 ^ ^ WC ?" e with a 

standard samples are used. ™ 10 Phosphate buffer and then they are filled with about 400 

In the case of other labels, such as radioisotope la. P ucrobtersofa 1 % albumen solution and left for several 
bels, chentf uminescem labels or enzyme labels analo- 5?" *? "T* rcsiduaJ "> the wells, 

gous means of distinguishing the individual signal from buffer WaShed again Phosphate 

one or from each of a pair of such labels are also well 15 t£ ,^ , . 

known. For example two radioisotopes such as l2 *l and resulting plate has in each of its wells two spots 

»«I may be readily distinguished on the basis of the T approximately 1 mm*. Measurement of the 

differing energies of their respective radioactive emis- 0 ™ uor f"*: e ^ in each well one spot 

sions. Likewise it is possible to identify the products of ^ * v X . molcculcs <* anti-TNF antibody 
two enzyme reactions, deriving from dual enzyme- 20 unr ^ ab ° Ut 5 X 109 ™>lecules of anti- 

labelled antibody couplets, these being e.g. of different J 1 *,^ « <Mgned for use with 

colours, or two chemiluminescent reactions, e.g. of v£ -Tf n ?/ V0 Ume 400 so that 0.1 

different chemfluminescent lifetime or wavelength of i ^ j 4 ** 0 moles (equivalent to 2.4xl0>° mole- 
light emission, by techniques well known in the resDec- , anti * TNF ^tibody and 7 X 10- M moles 
tive fields. ™ rCSpCC 23 (equivalent to 4X10»> molecules) for the anti-HCG 

The invention may be used for the assaying of ana- anUood y- 
lytes present in biological fluids, for example human rv AVfPT F - 

body fluids such as blood, serum, saliva or Urine. They A . U 2 

may be used for the assaying of a wide variety of hor- , * microliter plate prepared as described in Example 
mones, proteins, enzymes or other analytes which are 30 1 15 ^ m 2* ^ for artificially produced solution 
either present naturally in the liquid sample or may be comamill S TNF HCG. A test sample of the solu- 
present artificially such as drugs, poisons or the like &on, amounting to about 400 microliters, is added to one 
For example, the invention may be used to provide a ° A v~ ™ and flowed to incubate for several hours, 
device for quantitatively assaying a variety of hormones - microliters of various standard solutions 

I^ISJ? P"?™^ ««* reproduction, such as FSH, 35 ^"^gj^wn concentrations (0.02, 0.2, 2 and 20 
LH, HCG, prolactin and steroid hormones (eg proees - ? } ° f TNF or HCG are addcd "> other wells of the 
terone, estradiol, testosterone and androstene-dione) or tC m , d allowcd 10 incubate for several hours, 
hormones of the adrenal pituitary axis, such as Cortisol , S arc ***** washed several times with buffer 
A 9 11 *!* or thyroid-related hormones,' 

such as T4, T3, and TSH and their binding protein 40 "te-recogmuon reagents there are used for the 
TBG, or viruses such as hepatitis, AIDS or h\rpes vi- F Spots anti " TNF antibody having an affinity 

nis, or bacteria, such as staphylococci, streptococci. ™?^° T ™? at 25# C ofabout 1 * 1 Otters/mole 
pneumococci, gonococci and enterococci, or tumour. ^ G Spots 4,1 Uti -HCG antibody having an 

related peptides such as AFP or CEA, or drugs such as r / cons ? nt for HCG at 25' C. of about 1x10" 
those banned as illicit improvers of athletes' pcrfor- 45 • ~^ Both antibodies ar « labelled with lluores- 
mance, or food contaminants. In each case the bindinc i^nlr 400 mjcrolitcr aliquots of solutions of these 
agents used will be specific for the analytes to be as- ,abclled antibodies are added to the wells and allowed 
sayed (as compared with others in the sample) and may l ° 1^°^ for 2 fcw hours - ^ wells are then washed 
be monoclonal antibodies therefor. y with buffer. 

Further details on the methodology are to be found in 50 ^ resultin S fiu °r«cence ratio of each spot is quan- 
my International Patent Publication W088/01058 the • a Bio-Rad Lasersharp MRC 500 confocal 

contents of which are incorporated herein by reference mcrosc f>I*. From the standard solutions dose response 

The invention is illustrated by the following Exam- ££2^ ™f ^ d HCG arc built U P» figures for 
pies. 6 W being as follows: 

EXAMPLE 1 35 



* t-ktc / TNF concentration __nTCfhjorescence 

An anti-TNF (tumour necrosis factor) antibody ha v- ag/wl "ie»n Red fiuorocence" on ""^ «P°« 

mg an afTmity constant for TNF at 25' C. of about " 



0.02 u 



I X 10 s liters/mole is labeUed with Texas Red. A solu- 02 4 6 

tion of the antibody at a concentration of 80 micro- 60 J! M 

grams/ml is formed and 0.5 microliter aliquots of this - 1H 

solution are added in the form of droplets one to each 

well of a Dynatech Microfluor (opaque white) filled ^ ***** for HCG being as follows: 
polystyrene microliter plate having 12 wells. 
An anti-HCG (human chorionic gonadotropin) ami- 65 hcg 



body having an affinity constant for HCG at 25* C f cooccnuition fitc nuor«^n^ 

about 6X10* liters/mole is also Sb5£? S£ ~ J - Xedfl — °° HCG>POt 

Red. A solution of the antibody at a concentration of 80 1 J 

7J 
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"Continued in the urine of women in pregnancy testing, giving good 

HCG ooocesumtkm FTTC noorescepcc correlations with results obtained by other means and 

lottRednuoTBceace Q« HCG tpot achieving effective concentration measurements fox 

2 lto HCG over a concentration range of tw or three rders 

— 2 . H£ 5 of magnitude by correct choice of the best HCG spot 

and dose response curve. 
The artificially produced solution was found to give Production of labelled antibodies 
ratio readings of 5.9 on the TNF spot and 10.5 on the The labelling of the antibodies with fluorescent labels 
HCG spot, correlating well with the actual concentra- can be carried out by a well known and standard tech- 
lions of TNF (0.5 ng/ml) and HCG (0.5 ng/ml) ob- nique, see Leslie Hudson and Frank C Hay "Practical 
tamed from the dose response curves. Immunology", Blackwell Scientific Publications (1980), 

EXAMPLE 3 P a ^ tt 1 1-13. example as foUows: 

Using similar procedures to those outlined in Exam- SD ^ c Tri^^^^ FSH 3 °l ** FSH 
pie la microliter plate 15 S^lt^^!^^^^^ 

T4 (thyroxine) antibody (affinity constant about ^ ^Si 2 ^ ^ P rod ««d in 

IX 10" liters/mole at 25* C ), labelled anti-TSH (thy- £?. Mldd . lcsex Hospital Medical School and was la- 
roid stimulating hormone) antibody (affinity constant wth TRITC (rhodamine isothiocyanate) or 

about 5 X I0 9 liters/mole at 25* C) and labelled anti-T3 TcMS Red ' * a red fluor e**ncc 
(triiodothyronine) antibody (affinity constant about 20 Thc monoclonal antibody anti-FSH 8D10, a cross- 
1X10 11 liters/mole at 25* C) in each of the individual rcfl cting (alpha subunh) antibody having an affinity 
wells is produced, the spots containing less than constant (K) of lxl0 !I liters per mole, was likewise 
1X10-12 V moles of anti-T4 antibody or less than produced in the Middlesex Hospital Medical School 
2X10- 11 V moles of anti-TSH antibody or less than ™* was labelled with FITC (fluorescein isothiocya- 
1 X 10-" V moles of anti-T3 antibody. 25 nate), giving a yellow-green fluorescence, 

r d ~«° pmfi a ? tibod y (site-recognition reagent) The general procedure used involved ascites fluid 
for the TSH assay san anti-TSH antibody with an purification (ammonium sulphate precipitation and 

f ^rSl U ^5 2 ^i°l htCrS/m0lC al 25# T * cl chromatography) followed by labelLg, acco* 
G Tbs antibody is labelled with fluorescein (FITC). mg to the following steps: * 
The ^-recognition .reagents for the T4 and T3 assays 30 lA Ammonium sulphaTpurincation 
are T4 and T3 coupled to poly-lysinc and labelled with 1 a a a a i i h -punncaaon 
FITC and they recogrusTthe Willed siteTon S to V^ B J^ 

respective first antibodies. 0 5 ml ™ h *° 6y preparation (culture supernatant or 1 :5 

Using 400 microliter aliquots of standard solutions dUmC f aSCltCS fluid) UDder constanl Erring (45% satu : 
containing various known amounts of T4, T3 and TSH 35 raUon '' 

dose response curves are obtained by methods analo- 2 ' 001111x11,6 stir ™g for 30-90 min. Centrifuge at 2500 
gous to those in Example 2, correlating fluorescence rpm for 30 min - 

ratios with T4, T3 and TSH concentrations. The plate is 3 ' Discard supernatant and dissolve the prccipi- 
used to measure T4, T3 and TSH levels in serum from tote PBS volume 5 ml.). Repeat Steps 1 and 2, 
human patients with good correlation with the results 40 OR ~ 

obtained by other methods, 4. Add 3.6 ml saturated ammonium sulphate (40% 

EXAMPLE 4 saturation) under constant stirring. Repeat Step 2. 

5. Discard the supernatant and dissolve the pellet in 
Using similar procedures to those outlined in Exam- the desired buffer, 
pie la microliter plate ^ntaining spots of first labelled « 6. Dialyse overnight in cold against the same buffer 
ann-HCG antibody (affinity constant about 6X10* Ii- (using fresh, boiled-in^d/w dialysis bac) 
tera/mok at 25* CX second labelled anti-HCG ami- 7. Determine the protein concentration either at A™ 
body affinity constant about L3 X 10" liters/mole at or by Lowry estimation. DCCDtrauon CIther al A "0 

duced, the spots each containing tetC 0J v£ ^ Clear 2 ml of asates fluid by centrifugation at 4000 
moles of the respective antibody. A cross-reacting Tah^ ,vi T « i • ^ 
(alpha subunit) monoclonal antibody 8D10 with an Tns-Cl soluoon to achieve final concen- 

affinity constant of 1 X 10" liters/mole is used as a com- 55 Ua . u °? ° f a ™: 

mon developing antibody for both the HCG and the sufficient amount of solid potassium sulphate. 

FSH assays. Final conceutration:=0.5M. 

Using 400 microliter aliquots of standard solutions 4 * A PP lv ^ *scitt fluid to the T-gel column, 
containing various known concentrations of HCG and 5 - Wash tnc column with 0. 1 M Tris-Cl bufTer contain- 
FSH, dose response curves are obtained by methods 60 m £ °- 5M potassium sulphate, until protein profile (at 
analogous to those in Example 2, correlating fluores- A 28o) returns to zero. 

cence ratios with HCG and FSH concentrations, the 6 * Elute the absorbed protein using 0.1M Tris-Cl 
curve obtained with the higher affinity anti-HCG ami- buffer as the eluant. 

body giving more concentration-sensitive results at the 7. Pool the fractions containing antibody activity and 
lower HCG concentrations whereas the curve from the 65 concentrate using Amicon 30 concentrater 
lower affinity anti-HCG antibody is more concentra- 8. If HPHT purification is to be carried out, use 
tion-sensitive at the higher HCG concentrations. The HPHT chromatography Starting buffer during Step 7 
plate is used to measure HCG and FSH concentrations 2. Labelling of Antibodies nreTOTC^JK; 
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^SSSfBXlSSSlS ^^^^^ 

1am Fncm^rc to achieve . lfl0 rau0 ^ °?^ f MftK ^ Hi 

* *J™« *< conjugated protein f roni ^oonju- -MU «jf l^wS f^^tr"" 

/^'abd ch ~^ for TRITC '0 J-JjiJ. analytes whose ccSSS^^SSt 

"raSS"' h- ? A me,h0d daimed i» «hfa 1. wherein said 

r^,w iv' u binding agents are labelled with marked ouMm fte 

0.005M Phosphate, pH 8.0 and 0.18M Phosphate, „ concentration levels of said binding agents to be mea! 

pH 8.0 for (b). ** surcd. 

11. A method as claimed in claim 10, wherein said 

"iftimt- ^ ^ site-recognition reagents are 

UJUio--ajsxciD.49S» labelled with fluorescent markers such that at the indi! 

M SpOU Ae te chnique for measuring fac- 

I dam.: 20 Uona] occupancy of the binding agents measures X 

1. A method for determining the ambient concentra- of the signals emitted by the fluorescent markers, 

boas of a plurality of analytes in a liquid sample of " toT "« "> determining the ambient con- 

volume V liters, comprising: centra dons of a plurality of analytes in a liquid sample 

loading a plurality of different binding agents, each ™ ? *° . V If 16 ? 5 ' Sprang a solid support means 
being capable of reversibly binding an anaTy^ 25 lo f ,ed « hereo » « M*h coating density at a 

which is or may be present in the liquid sample and „Si « ^ ?5 Sp0ts ■ P lunJi «y of differ- 
is specific for said analyte as compared to the other ' h ^ img * 8eat * 

components of the liquid ample, onto a support STSh r "* Which " °' may be pres- 

meat* at a plurality of spaced apart smalTE 30 I T^X"* k 8pcdfic for ^ Ate 
such that each spot has a high costing denshTof LT~ch^ COmp ° nents « 

such that only an insignificant proportion of any «° 15. A kit for use in determining the ambient cnn ^ 

analyte present m said liquid sample becomes tration of a plurality of anah^^a SaJmSTSr 

bound to said binding agent specific for said ana- volume V liters, comprising: * P ° f 

lyte, and a solid support means having located thereon at hi*h 

measuring a parameter representative of the frac- coating density at a plurality of spac^a^sSS 

uonal occupancy by said analytes of said binding 45 spots ■ plurality of different binding agents, ^ch 

agents at the spots by a competitive or non-com- bmdin 8 agent being capable of reveribly bindme 

petrtive assay technique using a site-recognition ■» which is or may be present in said liquid 

reagent for each binding agent capable of recogniz- ^P 1 ' and " specific for said analyte as compared 

mg either the unfilled binding sites or the filled .„ to the other components of the liquid sample, rach 

binding sites on said binding agent, said ske-recog- spot not more than 0.1 V/K moles of a 

nraon reagent being labelled with a marker en- $ffl . gl . e bind5n 8 where K liters/mole is the 

abling the amount of said reagent in the particular constant of said single binding agent for 

location to be measured. reaction with the analyte to which it is specific- 

2. A method as claimed in claim 1, wherein each of « 8 pluraii,y of standard samples containing known 
said spots has a size of less than 1 mmi. 55 concentrations of the analytes whose concentra- 

3. A method as claimed in claim 2, wherein each of UOnS - . ,n * e U< ' uid ^P 1 ' are to be measured; and 
said spots contains more than 10« molecules of bindin* * ^-.V «rT ed "Recognition reagents for reaction 
agent 8 with Med or unfilled binding sites on said binding 

4. A method as claimed m claim 3. wherein each of «, , /TO , • 

said spots has less than 0.01 V/K moles of bindins A .^' ^ cl ^ ed "claim IS, wherein each of said 

agent * spots has a sue of less than 1 mm 2 . 

5. A method as claimed in claim 3, wherein said bind- s J« IS " ^ cUim J 6 ' whef ein each of said 
ing agents used have affinity constants for said anaJvtes m ° re lb ™ 104 molecules of binding 
of from 10« to 10" liters per mole. ^ 6j ** ent 
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ABSTRACT 



A method for determining the ambient concentrations of a 
plurality of analytes in a liquid sample of volume V liters, 
comprises 

loading a plurality of different binding agents, each being 
capable of reversibly binding an analyte which is or 
may be present in the liquid sample and is specific for 
that analyte as compared to the other components of the 
liquid sample, onto a support means at a plurality of 
spaced apart locations such that each location has not 
more than 0.1 V/K, preferably less than 0.01 V/K, 
moles of a single binding agent, where K liiers/mole is 
the equilibrium constant of the binding agent for the 
analyte; 

contacting the loaded support means with the liquid 
sample to be analyzed, such that each of the spaced 
apart locations is contacted in the same operation with 
the liquid sample, the amount of liquid used in the 
sample being such that only an insignificant proportion 
of any analyte present in the liquid sample becomes 
bound to the binding agent specific for it, and 

measuring a parameter representative of the fractional 
occupancy by the analytes of the binding agents at the 
spaced apart locations by a competitive or non- 
competitive assay technique using a site-recognition 
reagent for each binding agent capable of recognizing 
either the unfilled binding sites or the filled binding 
sites on the binding agent, said site -recognition reagent 
being labelled with a marker enabling the amount of 
said reagent in the particular location to be measured. 
A device and kit for use in the method are also 
provided. 
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DETERMINATION OF AMBIENT Vih^.m,! 2 

CONCENTRATIONS OF SEVERAL te^^^f*"* i8en ' »'«) ■« 

ANALVTES ^ ^ a ^ 

application Ser. No. 07/984.264 filed rw i T^ P «*uJT\ ant, « en 10 it binds. Thus. 

patent application Ser. No. 07/460,878, filed Feb. 2, 1990 ■ r V«d for bind U /^^^ B , ^ le F or,l,on! 
now abandoned, filed as PCT/GB88/00649, Aug. s. iff with fluidUple Ss^f orT oM 

FIELD OF THE INVENTION - 
? e present invent relates „ to delerninatioD rf S^^^f^^rS, 

other naturally occurring or anfficta£SsCZ« « S K i ** ^ ° f * e very b|S 

biological liquids such as body fl u idi P SU,D " S " ™ ^ \° 50 ,hat conCenLal 

Sh^W™ ffl0re,haD 10 mok ^«of binding .gent 
BACKGROUND OF THE INVENTION SwES'* ^ l0Wer af5ni 'y of ,be «*r of 

i l iir "icrs/molc necessitate the use of mnn> that, irvi2 _ , 

WO84/01031 to measure the concentration of an analvte in fa,s nlarkeled commercially at the present 1?! ^' 

bind! * T a T g ,bC fluid ^ a * a <* «"«»'' of a ^ ™ d use an Lount tf5^«£SL* 

binding agent such as an antibody specific for the analyte in **** 10 ° r - «*>" frequently, comKB S e S! r 

usually associated with the measurement of fractional occT eventration used to generate u£ *1 P lcsofknown 

binduig agent occupied by the analyte by the' equation: ,0 "cognised and acctpTed P teChmqUeS " 

KJm • UK Patenl APPli^tion 2.099.578A discloses a device for 

■XT "Sr mimunoassays comprising a porous solid support ,c which 

l ^ co ^«" is gcneraUy L>in J the «™m- However. although .he individual loauo^yb^ 

affimry constant, especially when the binding agent is an ¥ l0m ° f microdots produced toStiZ 

antibody, for example a monoclonal antibody 8 8 ° 50 dre I*« * antigen-containing solutions oTw-Lffifc 

The concept of using only, trace amount o binding agent nUmb " ° f moles of an 'ig« Present J^St 

is contrary l0 generally recommended practice in me leW 0 'T""* S,iJ1 as «0"gh ,0 bSSL 

immunoassay and immunometric techniques For example " ally aU of ,be ana| y ,e («=-6- antibody) wboL ln„„,ff 

SS^EETt." T ,h0dS m '-41^ d - «• b- ~«d tha, is pfesen, in SJSS 

SK^S^S^T'i!?- ^ ^ Be,SOn and R - S " » me'h^ ,S ^ a P par "' from *e f»« thai me quLS 
ralow, 1973 at pages 111-116, it is proposed thai in the Used ,D Ihal a PP"caUon (page 3 lines 21 2S 

performance of . competitive immunoassaymaximuj Tsen " ,V ° lvcs ca J«"a«ion with known amounts of inm«„™ ^ 

?3-l£; r, y v ct r e,i * ,be ^ = 1°*; ppIied ,o ,he ^ ^inrts 

tracer analyte tba is bound approximates to 50%. In order mp ' eS be,n 8 ,csled . essentially every molecule mu „ 

KZf ^ u 1 , W ' '° ' hlS day 8" era "y accepted T and hence ,hal lar 8e amounts of antigen (iV the 

of £5. ^ fie d ' rCqUireS ' ha ' lhe "ncentraLn "'"^ a 8 ent iD *« situation) are requS fa e .dh 

moSK i° r> SpeakiD8 ' of biDdiD 8 sites, each "'"^^■ly » excess of the total amount of "JS 

moi^ JwffM rr D,i0nalJy haViD8 ° De or a ' ( M y " ' hiS SitUa,i ° D) PreSeD ' iD ,be sample X 

« ^ SUMMARY OF THE INVENTION 

agentformean^.i.e.^l/K.ForaslmpleofvS «1&ZS£ZE^ 
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good sensitivity in immunoassays nor is it gcncraUy desir- 
able. If, instead of being kept as large as possible the 
amount f binding agent is reduced so that only an insig- 
nificant proportion of the analyte is reversibly bound to it, 
generaUy less than 10%, usually less tfaan 5% and for 
optimum results only 1 or 2% or less, not only is it no longer 
necessary to use an accurately controlled, constant volume 
for all the liquid samples (standard solutions and unknown 
samples) in a given assay, but it is also possible to obtain 
reliable and sometimes even improved estimates of analyte 
concentration using much less than V/K moles of binding 
agent binding sites, say not more than 0.1 V/K and prefer- 
ably less than 0.01 V/K. For a binding agent having an 
equilibrium constant (K) for the analyte of the order of 10" 
liters/mole and samples or approximately 1 ml size this is 
approximately equivalent to not more than 10 s , preferably 
less than 10 , molecules of binding agent al each location in 
an individual array. Iftoe value of K is 10" liters/mole the 
figures arc 10 andJO 5 molecules respectively, and if K is 
of the order of 10 s liters/mole they are 10 13 and 10 30 
molecules respectively. Below 10 2 molecules of binding 
agent at a single location the accuracy of the measurement 
would become progressively less as the fractional occu- 
pancy of the binding agent sites by the analyte would be able 
to change only in discrete steps as individual sites become 
occupied or unoccupied, but in principle at least the use of 
as low as 10 molecules would be permissible if an estimate 
with an accuracy of 10% is acceptable. Practical consider- 
ations may give rise to a preference for more than 10 4 
molecules. 

It wffl be appreciated that the abovementioned GB patent 
application 2,099,578A, which for quantitative estimation 
relies on large amounts of binding agent and essentially total 
sequestration of all analyte, fails to recognise the advance 
achieved by the present invention, which instead relies on a 
different analytical principle requiring measurement of the 
fractional occupancy of the binding agent and which thus 
requires only a very low proportion of the total analyte 
molecules present to be sequestered from the sample. 

Following the recognition that the use of such small 
amounts of binding agent is permissible, it becomes feasible 
to place the binding agent required for a single concentration 
measurement on a very small area of a solid support and 
hence to place in juxtaposition to one another but at spatially 
separate points on a single solid support a wide variety of 
different binding agents specific for different analytes which 
are or may be present simultaneously in a liquid to be 
analysed. Simultaneous exposure of each of the separate 
points to the liquid to be analysed will cause each biding 
agent spot to take up the analyte for which it is specific to 
an extent (i.e. fractional binding site occupancy) represen- 
tative of the analyte concentration in the liquid, provided 
only that the volume of solution and the analyte concentra- 
tion therein are large enough that only an insignificant 
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can also be used where appropriate Because the bindins 
agents are spatially separate from one another it is possible 
to use only a small number of different marker labeisTeven 
the same marker label throughout and to scan each bindinc 
agent location separately to determine the presence and 
concentration of the label By use of the invention consid- 
erably more than 3 analyses can be performed with a single 
exposure of the solid support with liquid to be analysed, for 
example 10, 20, 30, 50 or even up to 100 or several hundreds 
of analyses. 

* creforc ' present invention provides a 
method for determining the ambient concentrations of a 
plurality of analytes in a liquid sample of volume V biers 
comprising: 

loading a plurality of different binding agents, each being 
capable of reversibly binding an analyte which is or may be 
present m the liquid and is specific for that analyte as 
compared to the other components of the liquid sample, onto 
a support means at a plurality of spaced apart locations such 
that each location has not more than 0.1 V/K moles of a 
single binding agent, where K liters/mole is the equilibrium 
constant of the binding agent for the analyte, 

contacting the loaded support means with the liquid 
sample to be analysed such that each of the spaced apart 
locations is contacted in the same operation with the liquid 
sample the amount of liquid used in the sample being such 
that only an insignificant proportion of any analyte present 
m the hquid sample becomes bound to the binding agent 
specific for it, and 6 6 

measuring a parameter representative of the fractional 
occupancy by the analytes of the binding agents at the 
spaced apart locations by a competitive or non-competitive 
assay technique using a site -recognition reagent for each 
binding agent capable of recognising either the unfilled 
binding sites or the filled binding sites on the binding agent 
said siie-recogoition reagent being labelled with a marker 
enabling the amount of said reagent in the particular location 
to be measured. 

The invention also provides a device for use in determin- 
ing the ambient concentrations of a plurality of analytes in 
a hquid sample of volume V liters, comprising a sob'd 
support means having located thereon at a plurality of 
spaced apart locations a plurality of different binding agents, 
each binding agent being capable of revcrsibly bindinc an 
analyte which is or may be present in the liquid sample and 
is specific for that analyte as compared to the other compo- 
nents of the liquid sample, each location having not more 
than 0.1 V/K, preferably less than 0.01 V/K, moles of a 
single binding agent, where K liters/mole is the equilibrium 
constant of that binding agent for reaction with the analvte 
to which it is specific. 



A kit for use in the method according to the invention 
reaction with filli-H nr i. n fiiu,i k;«j: :._„ .. 6 . . 



using separate site-recognition reagents which recognise 
either the unfilled binding sites or filled binding sites of the 
different binding agents and which are labelled with markers 
enabling the concentration levels of the separate reagents 
bound to the different binding agents to be measured for 
example fluorescent markers. Such measurements may be 
performed conseoi lively, for example using a laser which 
scans across the support, or simultaneously, for example 
using a photographic plate, depending on the nature of the 
labels. Other imaging devices such as a television camera 
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-wgmiiyB rcaeenis lor 
reaction with filled or unfilled binding sites on the bindine 
agents. * 

In arriving at the method of the invention, I have found 
that, generally speaking, for antibodies having an affinity 
constant K liters/mole for an antigen, the relationship 
between the antibody concentration and the fractional occu- 
pancy of the binding sites al any particular antigen concen- 
tration and the relationship between the antibody concen- 
tration and the percentage of antigen bound to the binding 
sites at any particular antigen concentration follow the same 
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curves provided thai ibc antibody concentrations and the may inter alia be white or blade, sucb as carbon black when 
antigen concentrations are each expressed in terms of frac the signals to be measured from tbe binding agemor to 
uons r multiples of site -recognition reagent arc light signals, as from fluorescent 

BRIEF DESCRIPTION OF TOP na awi wr ° l !r UmiD f " n L ma *«*- In general, reflective materials are 

BKIfcf UtoLKirJlON OF THE DRAWING 5 preferred in this case to enhance light collection in the 

Tbe principle underlying the method of the invention ma v d * ,ecl «« instrument or photographic plate. The final choice 
be better understood by reference to tbe accompanying ^ optimum material is governed by its ability to attach the 
drawing which is a graph representing two sets of curves mg agCDt l .° lts surfacc ' ils absence of background signal 

plotting the relationship between antibody concentration and emissl0D possession of other properties lending to 

the fractional occupancy of the binding sites at certain 10 maximise ^ »gD»l/boise ratio for the particular marker or 
prescribed antigen concentrations and the relationship markers attached to tbe binding agent situated on its surface, 
between antibody concentration and the percentage of ami- y f 311 ^ 3010 ^ rcsults havc been obtained in the Examples 

gen bound to tbe binding sites at the same prescribed antigen dcscnbcd below by the use of a white opaque polystyrene 
concentrations. Each curve relates to tbe antibody concen- microlllre P^te commercially available from Dvnatech 
tration [Ab], expressed in terms of 1/K, plotted along the 15 UDdcr lrade Dame Whi,e Mic ">fluor microtitre wells, 
x-axis. For the set of curves which remain constant or Tbc binding agents used may be binding agents of dif- 
decline with increasing [Ab], tbe y-axis represents the fcrcDl s P ecifici| y» lhal * 10 say agents which are specific to 
fractional occupancy (F) of binding sites on tbe antibody by diff ereot analyies, or two or more of them may be binding 
the antigen; for the second set, the y-axis represents the agents of tbe same specificity but of different affinity, that is 
percentage (be) of antigen bound to those binding sites. The 20 l0 . My a 8 cnts wnich arc specific to the same analyte but have 
individual curves in each set represent the relationships diffcrc nt equilibrium constants K for reaction with It. The 
corresponding to four different antigen concentrations [An] laller alternative is particularly useful where the concentra- 
expressed in terms of K, namely 10/K, 1.0/K, 0.1/K and ll0D of ana,vlc l ° be assayed in the unknown sample can 
0.01/K. Tbe curves show that as [Ab] falls F reaches an vary ovcr C0Dsiderab le ranges, for example 2 or 3 orders of 
essentially constant level, tbe value of which is dependent on 25 ma & niludc » 35 in ^e case of HCG measurement in urine of 
[An]. pregnant women, where it can vary from 0.1 to 100 or more 

IU/ml. 

DETAILED DESCRIPTION Tbe binding agents used will preferably be antibodies, 

The choice of a solid support is a matter to be left to the ^ mOD °clonaJ antibodies. Monoclonal ami-' 

iiserPreferablymesup^ 30 vanety of mgred^nts of biological fluids 

agentisdisp^onitssurfacxJorexampleasamonoW " h f?™ * "*™ y * made by kaa9m 

Use of a porous support may cause the bindine «em * 9 antibodies used may display conventional 

dependmg£itsmole P ^ ^r™^'^^ " ? ^ }il€M 

pores of tbe support where its exposure to the analyte whose ^.^^i 0 °' 10 blere/molc ' bul 

concentration £ to be determiircdmay likewis^ bfaffeS 35 r!* fT* ^Tt* °° DSlaDls of 1022 - 10 " 

by tbe geometry of tbe pores, so that a false readine may be u f °5- C, ° ** "f d ; ™e invention can be used with 

obuined. Porous suppoSs such as nitrocellu ose pfpTr do^ ^ m 001 lbcrosclvc * 

ted with spots of binding agent are therefore Tj£( uZ^V^' ^ t Dd f " qUCDlly dcsirablc 10 usc 

Unlike the supports used in GB 2,099,578A, which seem to > , 50 ^ lbc SySlcm biDdin S •*«*/ 

need to be porous because of tbe large number of S 4 ° T ^^^on reagent includes two different labels 

to be attached, the supports for use in tbe present invention same type, e g. fluorescent, chemiluminescent, 

use much smaller q^anthies and thereto* okT^ eDZ *™ °* radl01SOIO P lc ' 0DC on < hc Ending agent and one 

porous. Tbe non-porous supports may, for example be of * '"gent. Tbe measuring operation 

pasties material oYglass, andanv convenient ririd plasiics J" "T"? " ^ ° f ° f ** lwo signak and 

material may be usL. Wystyrene is a pre3 J as ," 45 £l^ T< am ° UDl ° f 

material, although other polyolefins or acrylic or vinvl i r g J*? 1 00 /^PP™ whc ° measuring 

polymers could likewise bTused. V Slga * h from slandard sam P lcs for calibration purposes as 

- when measuring signals from tbe unknown samnlec 

The support means may comprue m.crobcads. e.g. of Because the sysfcm depends solclv on me^urememol 

such a plastics miend. wb.ch can be coaled with uniform 50 ratio representative of binding site occupancT bere is ato 

layers of b. 0 d ing agent ,nd re.a.ned ,„ specified locations, no need to measure tbe stgnal from tbe en.u-e s^t bm 

e.g. hollo^ on a suppon pla.e. AHemalively tbe material scanning ool v a portion is sufficient. Each bind" E Cm t 

may be form of « sheet orplate wh IC his spotted with preferably lai>eUed with the same label bu, differfnUab^b 

an array of dots of binding agent. It can be advantageous for can be used. "'uercm larwis 

the configuration of the suppon means to be sucb that liouid « -n— k „a- . u , 

samples of approximatelv Se volume V lit« are Iwdflv «f ^ blDdin8 1/ a e eots ^ be ^ '° suppon in any 

reZed in contact with .'be plurality of Sp S apar o^- Idto >l?L nTi °' T'V^l ^ 

tions marked with tic different binding agents. For examp?e ^"f ino 8 "l 0D, ° & T° ^ ^ f ° f eX4mple by 

the spaced apart locations may be arranged in a weu K Z 2 h h a P a ". lo " l ' 0D ° n ™PP°" with a 

support means, aod a pluralit/of wells, each provided wSb 60 '? VT °- ' T'" df ° P> 

the same group of different binding aeeots in snaced anar. ™«°bier,on a 1 mm 5 spot, and allowing them to remain 

loc..ions,can be linked i«h??fS?SSS ph. " ^T' *V ^ ° f T, * bt dK>pS 

for use with a plurality of samples. P 3W3y ' * C °T° l fraC "° n ° f ,ht bindin 6 *6ent 

„-.... L , . preseDt in tbe drop becomes adsorbed onto tbe suppon as a 

When the support means is to be used in conjunction with resuJ. of this procedure. 1. is to be noted that tbe coa^rJ 
a measuring systtm moving light scanning, tbe material, 65 density of binding agent on the microspo. does not need to 

e.g. plastics, for Ae support ,s des,rably opaque to light, for be less than me coating density in con^iona^^ 

example it may be filled with an opaafymg material which coated tubes; tbe reduction in L number of molecules^, 
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density is generally desirable VmS£ tl2 such a^^S ? J - * «*»P™* £>™«« , 

ratios. Toe sizes of fee spots are advantaSvKa^S SS " ^ bm * COn " n0n bindin 8 

mm*, preferably less U»n 1 mn.^ The *pmtio n b ^117^ "^^ reagen,s ,bk 10 with 

desirably, but no, necessarily, 2 ct?tiJ,V2dKte 5 T * m8 ? ? m ni0re ,b " 0ne of *«»■ 

spot, or more. These suggested geometries can nevertheless t .■ , ^ ted u»que »e signals representative of the 

be changed as required, being subject solely to the KmhT I occupancy of the binding agent in the test samples 

dons on the numberof binding , gen, nxSs in eacK. £ S£T ?° n « om,llons ° f «* can be calibrated 

the minimum volume of mTsample to wS me w.Tof S L 0 « ° ^ CUIV « obtuned fin, sundard 

spots will be exposed and the me'ans IocSry.v5lSl fa 10 rrKeHS*™.^™*" ° f Ue Mffle 

— y preparing an array of ^ I £S£ tt^p^ 

Once the binding agents have been coated on. the 
support ,t is conventional practice to wash the support, in the 1S « *&*niflt antibody) or unoccupied fedfe shes7S 
case of annbod.es is binding agents, with a solution con- " w t th » an.i-idiorypic antibody), as one is thecoo^tf £ 
laming albumen or other protein to saturate all remaining ? tber ; For Boater accuracy it is desirable to measure the 
non-specific adsorption sites on the support and elsewhere ^ act, , 0D whico «» cl °*r «o zero because a change in frac- 
To confirm that the amount of binding agent in as individual ,,0Dal •ffP^cy of 0.01 is proportionately greater in this 
spot will be less than the maximum amount (0.1 V/K) , n $!^, , ,* lth ? u * b for Actional occupancies in the ranee 
required to conform to the principle of the present invention 5% * llber allernalive " generally satisfactory, 

the amount of binding agent present on any individual site ' ,n ,ba ' embodiment of the present invention which relies 
can be checked by labelling the binding agent with a °° *"? flu ° rescenl markers, toe measurement of relative 
delectable marker of known specific activity (i.e. known !° l !? S1,y ° f ,be signals from »* ,wo markers, one on the 
amount of marker per unit weight of binding agent) and „ ^ agent and the other on the site recognition reagent, 
measuring the amount of marker present. Thus, if the use of »H Vl"^- Tf? 8 IaS f r scannin g confocal microscope 
labelled binder is no, desired on the solid support used kibe , S*** Lase,shar P MR C 500. available from 

method of the invention the binding agent can nevertheless Hon',^^?- 0 "" Ud - Md baVm * a dual «*■»«• 
be labelled in a trial experiment and identical condition^ tet^^""™?" ° D " ^ be " m to scao 
those found in that trial to give rise to correct loading of ,h! \ J*' ° D , lbe ^P 0 " 10 cause florescence of 
binding agen. can be used ,o 'apply unlabSe^Tbinding Ten X ™ ,£T ^,^7^ fillels '° ****** »»d mea- 
to the supports to be actually used 8 8 f lbe am0un,s of florescence emitted. Tune-resolved 

The minimum size of the liquid sample (V li, e ^ u ^T*""* may ab ° * uscd - Interference (so- 

correlated with the number of mdteofSL.eS c fed crosstalk) between the two channels can be compen- 

Iban 0.1 V/K) so that only ^S^S^o^JZ S£ corrections if it occurs or conventual 

analyie present in me liquid s^El^ J u 35 f 0 " 5 CaD * made lo reduce " Domination of the rwo 
binding agent. This proportion K a Sri r5 S "than J^Ti ^""l en "" ed by ,be ^ V" « 
10%, usually less than 5% and Tde^ably Tor ^W fi^ P "V?- PreSem f0nD ° f mis i^trument, by 

depending on the accuracy desired for the ™JZ Z^l fl"l C ?f* ° f *e characteristic wave- 

accuracy being obtained, other "hinc bet/^Lt 8 '^ ? t ' W ° fluorcsoen ' emissions; however, fluorcs- 

'^Uv^^ly^M^^^,^' 40 T S y bS,aDCeS 66 ^^g-^ed by other physical 
of omer error-introducing Z£ tlHS ^"c'ensncs such as differing fluorescence decay limes, 
the order of one or a 8 Sm LfirS S^? 8 " meS, I"* a ° y ° f ,hese means ma y ^ «-«« 

microliters or less, are often preferred bm cfaSLuJS flu^ni °' !? ' 0,Dbulall0D ' 10 differentiate between rwo 
may arise when larger volumes « n«L C " CUmslaDO f s A*«>ropbores and hence permit measurement of the ratio of 

J^^tM^^j^szss^Si 45 ,ab ^ ,led en,i,ies (binding a8e «" and 8ite - 

sample may be used „ Us JLn^mS^'I^t ™* m '"f" 1 ) P" 56 "' 0D »» individual spot, using 

3^:— Err S£3Hs=SE= 

labelled, or the analyte cov.lenUy eSteS toTno hi? ^^V n8U " h - ,B « , lhC ^ d ' VidUal signals from one or f'°rn 
molecule-<.g. a protein moS-St , * P ™ 2 SUCb labels " e iUo we " >»>own. For 

indirectiv labelled The si^ ,™;.- ly ° r " ample ,wo "dioisotopes s«cb as " s l and »'I may be 

manner. Other labels such as cheoilumkescem « „ ? 8 ' COlOU,S • 0r lwo chemfluminescent 

radioisotopic labels may be U *£ ff tS F .T. . L ■ ' u'' 8 , ,° f . d ' fferen ' chemi,u minescent lifetime or 

ttt&TT S^^SyS ^same- „ i^^a"-^ ^ ^ » 

label but different labels can be used in different reaoemc tu , 

The site-recognition reagents may be specif TforTsinpt may USed f ° r ,he o{ aDal y'« 

8 may oe specific for a angle present in biological fluids, for example human body fluids 
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sucb as blood, scrum, saliva or urine. They may be used for 
the assaying of a wide variety of hormones, proteins, 
enzymes or other analytes which are either present naturally 
in the liquid sample or may be present artificially such as 
drugs, poisons or the like. 

For example, the invention may be used to provide a 
device for quantitatively assaying a variety of hormones 
relating to pregnancy and reproduction, such as FSH, LH, 
HCG, prolactin and steroid hormones (e.g. progesterone, 
estradiol, testosterone and androstene-dione), or hormones 
of the adrenal pituitary axis, sucb as Cortisol, ACTH and 
aldosterone, or thyroid- related hormones, sucb as T4, T3, 
and TSH and their binding protein TBG, or viruses such as 
hepatitis, AIDS or herpes virus, or bacteria, sucb as 
staphylococci, streptococci, pneumococci, gonococci and 
cnterococci, or tumour-related peptides sucb as AFP or 
CEA, or drugs such as those banned as illicit improvers of 
athletes' performance, or food contaminants. In each case 
the binding agents used will be specific for the analytes to be 
assayed (as compared with others in tbe sample) and may be 
monoclonal antibodies therefor. 

Further details on the methodology are to be found in my 
International Patent Publication WO88/01058, the contents 
of which are incorporated herein by reference. 

The invention is illustrated by the foUowing Examples. 

EXAMPLE 1 

An anli-TNF (tumour necrosis factor) antibody having an 
affinity constant for TNF at 25° C. of about lxlO 9 liters/mole 30 
is labelled with Texas Red. A solution of the antibody at a 
concentration of 80 micTograms/ml is formed and 0.5 micro- 
liter aliquots of this solution are added in the form of 
droplets one to each well of a Dynatecb Microfluor (opaque 
white) filled polystyrene microtilre plate having 12 wells. 

An anti-HCG (human chorionic gonadotropin) antibody 
having an affinity constant for HCG at 25° C. of about 6xl0 8 
liters/mole is also labelled with Texas Red. A solution of the 
antibody at a concentration of 80 micrograms/ml is formed 
and 0.5 microliter aliquots of this solution are added in tbe 
form of droplets one to each well of the same Dynatecb 
Microfluor microti tre plate. 

After addition of tbe droplets the plate is left for a few 
hours in a humid atmosphere to prevent evaporation of the 
droplets. During this time some of the antibody molecules in 
the droplets become adsorbed onto the plate. Next, the wells 
are washed several limes with a phosphate buffer and then 
they are filled with about 400 microliters of a \% albumen 
solution and left for several hours to saturate the residual 
binding sites in the we Us. Thereafter they are washed again 
with pbospbate buffer. 

The resulting plate has in each of its wells two spots each 
of area approximately 1 mm 2 . Measurement of tbe amount 
of fluorescence shows that in each well one spot contains 
about 5x10* molecules of anU-TNF antibody and tbe other 
contains about 5x10* molecules of anti-HCG antibody. The 
wells are designed for use with liquid samples of volume 
400 microliters, so that 0.1 V/K is 4xl0" J4 moles 
(equivalent to 2.4x10" molecules) for the anu-TNF anti- 
body and 7xl0" 14 moles (equivalent to 4xl0 JO molecules) 
for tbe anti-HCG antibody. 
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amounting to about 4O0 microliters, is added to one of the 
wells and allowed to incubate for several hours. About 400 
microliters of various standard solutions containing known 
concentrations (0.02, 0.2, 2 and 20 ng/ml) of TNF or HCG 
are added to other wells of the plate and also allowed to 
incubate for several bours. The wells are then washed 
several times with buffer solution. 

As site -recognition reagents there are used for the TNF 
spots an anti-TNF antibody having an a ffini ty constant for 
TNF at 25° C. of about lxl0 J0 liters/molc and for the HCG 
spots an anti-HCG antibody having an affinity constant for 
HCG at 25° C. of about 1x10" liters/mole. Both antibodies 
are labelled with fluorescein (FTTC). 400 microliter aliquots 
of solutions of these labelled antibodies are added to the 
wells and allowed to stand for a few hours. The wells arc 
then washed with buffer. 

Tbe resulting fluorescence ratio of eacb spot is quantified 
with a Bio-Rad Lasersharp MRC 500confocal microscope. 
From tbe standard solutions dose response curves for TNF 
and HCG are built up, the figures for TNF being as follows: 



TNF cooctntratioo 
ng/ml 



FTTC fluorescence _ _ 

i«« Kedliuoroceocx ooTNF «P ot 



0.02 

0.2 
2 
20 



1J 
4.6 

7.9 
42.5 



and those for HCG being as follows: 



HCG concentration 
ng/mJ 



FTTC fluoresce pee „„ 
Jerked fluoresce 00 *P° l 
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0.02 
0.2 
2 
20 



1.8 
7.2 
16.0 
2Z.2 
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EXAMPLE 2 

A microtilre plate prepared as described in Example 1 is 65 
used in an assay for an artificially produced solution con- 
taining TNF and HCG. A lest sample of the solution, 



The artificially produced solution was found to give ratio 
readings of 5.9 on tbe TNF spot and 105 on tbe HCG spot, 
correlating well with the actual concentrations of TNF (05 
ng/ml) and HCG (0.5 ng/ml) obtained from the dose 
response curves. 

EXAMPLE 3 

Using similar procedures to those outlined in Example 1 
a microtilre plate containing spots of labelled anti-T4 
(thyroxine) antibody (affinity constant about 1x10" liters/ 
mole at 25° C), labelled anti-TSH (thyroid stimulating 
hormone) antibody (affinity constant about 5x10 s liters/ 
mole at 25° C.) and labelled anti-T3 (triiodothyronine) 
antibody (affinity constant about 1x10" liters/mole at 25° 
C.) in each of the individual wells is produced, the spots 
containing less than 3 xlO - " V moles of anti-T4 antibody or 
less than 2xl0" n V moles of anti-TSH antibody or less than 
lxlO" 12 V moles of anti-T3 antibody. 

The developing antibody (site-recognition reagent) for the 
TSH assay is an anti-TSH antibody with an affinity constant 
for TSH of 2xl0 JO liters/mole at* 25° C. This antibody is 
labelled with fluorescein (FITC). The site-recognition 
reagents for tbe T4 and T3 assays are T4 and T3 coupled to 
poly-lysine and labelled with FITC, and tbey recognise the 
unfilled sites on their respective first antibodies. 

Using 400 microliter aliquots of standard solutions con- 
taining various known amounts of T4, T3 and TSH, dose 
response curves are obtained by methods analogous to those 
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m Example 2, correlating fluorescence ratios with T4 13 
and TSH coocemrations. The plaic is used to measure T4 73 
and TSH levels in senim from human patients with cood 
correlation with the results obtained by other methods 

EXAMPLE 4 
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Using similar procedures to those outlined in Example 1 
a microutre plate containing spots of first labelled anti-HCG 

antibody (affinity constant about 6x10° liters/mole at 25° • 

C), second labelled anti-HCG antibody (affinity constant » c ?' b - T ^ cI Cbromatograj 

about 13x10" liters/mole at 25° C.) and labelled ami FSH P°**ssium sulphate) 

(follicle stimulating homWi /.«.: 1. Gear 2 ml of ascites fluid by centrifiigaiion at 4000 

rpm. 



4. Add 3.6 ml saturated ammonium sulphate (40% 
saturation) under constant stirring.- Repeat Step 2. 

5. Discard the supernatant and dissolve the pellet in the 
desired buffer. 

6. Dialyse overnight in cold against the same buffer (using 
fresh, bofled-in-dAv dialysis bag). 

7. Determine the protein concentration either at V or by 
Lowry estimation. 

l.b. T-gel Chromatography: (Buffer: 1M Tris-Cl. pH 7 6 
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(follicle stimulating hormone) antibody (affinity constant 
about 13*10* liters/mole at 25' C) in each of tb4S 
wefls is produced the spots each containing less than 0.1 
V/K moles of the respective antibody. A cross-rcacting 
(alpha suburjn) monoclonal antibody 8D10 with an affinity 
constant of 1x10" hters/molc is used as a common devel'. 
oping antibody for both the HCG and the FSH assays 

Using 400 microliter aliquots of standard solutions con- 
laming various known concentrations of HCG and FSH 
dose response curves are obtained by methods analogous to 
U ^u Pk 2 ' °° rrelaliD g fluorescence ratios with 
HCG and FSH concentrations, the curve obtained with the 
higher affinity anti-HCG antibody giving more 
concentration-sensitive results at the lower HCG concentra- 
tions whereas the curve from the lower affinity anti-HCG 
antibody is more concemradoo-sensitive at the higher HCG 
concentrations. The plate is used to measure HCG and FSH 
concentrates in the urine of women in pregnancy testing 
giving good correlations with results obtained by other' 
means and achieving effective concentration measurements 
for HCG over a concentration range of two or three orders 
of magnitude by correct choice of the best HCG spot and 
dose response curve. 

Production of Labelled Antibodies 

The labelling of the antibodies with fluorescent labels can 
be earned out by a well known and standard technique see 
Leslie Hudson and Frank C. Hay, "Practical Immunology", 
Blackwell Scientific Publications (1980), pages 11-13, fo 40 
example as follows: ' ^ 

The monoclonal antibody anti-FSH 3G3, an FSH specific 
(beta subunit) antibody having an affinity constant (K) of 
13x10" liters per mole, was produced in the Middlesex 
Hospital Medical School, and was labelled with TRITC 

Swe^ice 150 ^^ 1141 ^ ° F TCXaS Rcd ' giviDg a rcd 
The monoclonal anubody anti-FSH 8D10 a cross- 
, r " a j g ^JPb^«bunit) antibody having an affini.y constant 

Middlesex Hospital Medical School and was labelled with 

fl^riSna^' 11 *° IhioCyanale) ' * ivin * a y«=»ow. g recn 

Tne general procedure used involved ascites fluid purifi- 
cation (ammonium sulphate precipitation and T-eel 
chromatography) foDowed by labelling, according to fne 
following steps: 6 

l.a. Ammonium sulphate purification 

1. Add 4.1 ml saturated ammonium sulphate solution to 5 
ml anubody preparation (culture supernatant or 15 
diluted ascites fluid) under constant stirrine (45% 
saturation). 6 x 

2. Continue stirring for 30-90 min. Centrifuge at 2500 
rpm for 30 min. 

3< Sc?,? ^ S " pcrnalant aDd disso lv »be precipitate in 
PBS (final volume 5 ml.). Repeat Steps 1 and 2, OR 
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30 



45 



50 



2 * A fn !w Tris " Q S0luti0 ° 10 achievc fio*l concentration 
ot 0.1M. 

3. Add sufficient amount of solid potassium sulphate. 
Final coDcentration-O^M. 

4. Apply the ascile fluid to the T-gel column. 

5. Wash the column with 0.1M Tris-Cl buffer containing 
0.5M potassium sulphate, until protein profile (at A,3 
returns to zero. * 1 ^*° } 

6. Elute the absorbed protein using 0.1 M Tris-Cl buffer as 
the eluant. 

7. Pool the fractions containing antibody activity and 
concentrate using Amicon 30 concentrater. 

8. If HPHT purification is to be carried out, use HPHT 
chromatography Starting buffer during Step 7 

2. Labelling of Antibodies FITC/TRITC conjugation 
1. Dialyse the purified 1 g protein into 0.25M Carbonate- 

bicarbonate buffer, pH 9.0 to a concentration of 20 
mg/ml. 

2 ' , Add n n TC/TOTC to achieve a 1:20 ratio with protein 
(i.e. 0.05 mg for every 1 mg of protein). 

3. Mix and incubate at 4° C. for 16-18 hrs. 

4. Separate the conjugated protein from unconjugated by 

a. Scpbadex G-25 chromatography for FTTC label, 

b. DEAE-Sepbacel chromatography for TRITC/FITC 
label. 

Buffer system: 
PBS for (a). 

0.005M Phosphate, pH 8.0 and 0.18M 
Phosphate, pH 8.0 for (b). 

Calculation of FITC: Protein coupling ratio: - 



or 



2.87 x CLP. 495 am 
O.D. 280 nm^ 
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- U x C>i) 49i nm 



EXAMPLE 4 

Regents 

1 TSH standards from the National Institute for Biological 
Standards and Control 

2 TSH-free Serum for making up TSH standards 

3 125 l-)abelled TSH 

4 Anti-TSH monoclonal antibodies from The Scottish 
Antibody Production Unit 

5 Phosphate buffer, 0.1M, pH 7.4 

6 Tris-HCl buffer, 0.05M, pH 7.6, containing 0.5% bovine 
senim albumin (BSA), 0.05% Tween 20 and 0.1% 
sodium azide 

7 Wash buffer: Phosphate buffer, 0.1 M. pH 7.4, containing 
0.1% Tween 20 and 0.1% sodium azide 
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8 Black microti ire strips from Dynatcch 

9 SuperBlock from Pierce 

A. Protocol and Conditions for the Radioimmunoassay of 
Thyroid Stimulating Hormone (TSH) 

1. An aliquot of 50 of 50 M g/m\ anti-TSH monoclonal 
antibody in phosphate buffer was added to microiitre 
wells and incubated for 1 hour at room temperature. 

2. Tbe microtitre wells were washed with phosphate 
buffer, blocked with SuperBlock for 30 minutes at 
room temperature and then washed again. 

3. An aliquot of 100 ^ of TSH standards made up in 
TSH-free serum (to yield final concentrations of 0 

1£\ 2X1 !K ' 4X1 °~ S ' 8xl0 " 9 ' 12x10 ' 9 . 16xl(T 9 and 
20x10 M/L) or unknown serum samples and 100 id of 
'^-labelled TSH in Tris-HQ assay buffer were added 
to triplicate anti-TSH monoclonil antibody coated 
microtitre wells, shaken for 1 hour at room 
temperature, washed with wash buffer and counted for 
radioactivity. The concentration of TSH in the 
unknown samples can be read from the standard curve 
The incubation period of 1 hour for the assay is far less 
than tbe time required for the binding reaction to go to 
equilibrium, but, provided tbe standards are measured under 
the same conditions, Lhe unknown sample can be measured 
against those standards. The effective affinity constant for 
the antibody will of course be that which penains after 1 
bco^incubation and under the same conditions as the assay 

B. Procedure for Obtaining tbe Affinity Constant K of the 
Anti-TSH Monoclonal Antibody Used in a Radioimmunoas- 
say Performed Under the Conditions Described in (A) 
1. An aliquot of 50 A] of SO^g/ml anti-TSH monoclonal 
antibody in phosphate buffer was added to microtitre 
wells and incubated for 1 hour at room temperature. 
2 Tbe microtitre wells were washed with phosphate 
buffer, blocked with SuperBlock for 30 minutes at 
room temperature and then washed again. 

3. An aliquot of 100 /d of TSH standards made up in 
TSH-free serum (to yield final concentrations of 0 
lxl0- p , 2xl0- 9 , 4xl0- 9 , 8x10"*, 12x10-*, 16xl0- p ' 
and 20xl0- ff M/L) and 100 /d of 125 Mabelled TSH in 
Tris-HCl assay buffer were added to triplicate antibodv 
coated microtitre wells, shaken for 1 hour at room 
temperature, washed with wash buffer and counted for 
radioactivity. 

4. A standard Scatcbard plot of Bound/Free vs. Bound 
TSH was used to obtain tbe affinity constant K for the 
monoclonal anti-TSH antibody. 

C. A TSH Assay Using an Amount of Capture Antibody 50 
*0.1 V/K and Deposited on the Solid-Phase as Microspots 

Since tbe assay volume V is 0.2 ml or 2x10- L and the 
affinity constant K of tbe anti-TSH capture antibody used 
under conditions described in (B) was found to be l.lxlO 8 
L/M, therefore tbe maximum amount of capture antibodv 55 
allowed in the assay under ambient analyte condition 
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14 

aspirated instantly. This procedure resulted in antibodv 
microspots with a coated area of approximately 1& 



Molar amount of coaxed antibody on miaospoc 

" 1°?** *!* * IDtibod ? *wityyAvogidiD Number - 

(JO - x ]0*y(6.01 » lO^M 
- 1.7 « 10' ,4 M 



or a capture antibody concentration of 0.85x1 0" 10 M/L. 

2. T*e microtitre wells were washed with phosphate 
buffer and tbe unreacted sites blocked with SuperBlock 
for 30 minutes at room temperature and then washed 
again with phosphate buffer. 

3. 100 ^ of TSH standards (made up in TSH-free serum) 
or unknown samples plus 100 & of Tris-HCl assay 
buffer were added to triplicate microtitre wells, shaken 
for 1 hour at room temperature and washed with wash 
buffer. 
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-0.1 V/K 

• (0.1 x 2 m 10->1.1 x 30»M 
- 3.8 x 10""M 



Or a capture antibody concentration of 9x1 0 10 M/L. 
Assay Protocol: 

1. A 0.5 fA droplet of a monoclonal anti-TSH capture 
antibody in phosphate buffer and at a concentration of 
200 /ig/inl was added to each microtitre well and 
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4. The TSH bound sites were back-titrated using fluores- 
cent labelled anti-TSH developing monoclonal anti- 
body raised against a different site on tbe TSH molecule 
and complementary to the capture antibody deposited 
as microdot on the solid-phase. An aliquot of 200 id of 
the developing antibody in Tris-HQ assay buffer was 
added to the microtitre wells, shaken for 1 hour at room 
temperature, washed with wash buffer, scanned with a 
BioRad laser scanning confocal microscope and the 
amount of fluorescence on the microspots and the 
amount of fluorescence on tbe microspots quantified 
Tbe concentration of TSH in the unknown samples 
were read from tbe standard curve. 
Although, for the purpose of illustration, tbe affinity 
constant of the antibody was measured under the assay 
conditions, id practice, in many cases it may not be neces- 
sary actually to perform such a measurement, so lone as it 
is obvious, having regard to the details of the assay in 
question, that the amount of capture antibody used on anv 
spot is going to be less than 0.1 V/K. 
What is claimed is: 

1. A method for determining tbe ambient concentration of 
an analyte of interest among a plurality of analytes in a 
liquid sample of volume V liters, comprising: 
loading a plurality of different binding agents, each being 
labelled with a marker and being capable of reversibly 
binding an analyte which is or may be present in the 
liquid sample and is specific for said analyte as com- 
pared to the other components of the liquid sample 
onto a support means at a plurality of spaced apart 
small spots sucb that not more than 0.1 V/K moles of 
binding agent are present on any spot, wbere K liters/ 
mole is the affinity constant of said binding agent for 
said analyte; 

contacting tbe loaded support means with the liquid 
sample to be analyzed, such that each of the spots is 
contacted in the same step with said liquid sample the 
amount of bquid used in said sample being such 'that 
only an insignificant proportion of any analyte present 
in said liquid sample becomes bound to said binding 
agent specific for said analyte; 

contacting tbe support with a site-recognition reagent 
specific for each binding agent in a competitive or 
noncompetitive technique, the site-recognition reagent 
being capable of recognizing either the unfilled binding 
sites or the filled binding sites on said binding agent, 
said site-recognition reagent being labelled with a 
marker different from tbe marker on said binding agent 
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measuring a ratio of signals from said markers on the sile 
recognition reagent and the binding reagent from at 
least a part of the spot, from which the analyte to 
interest is determined. 

2. A method according to claim 1, wherein the markers on 
the site-recognition reagent and the binding reagent are 
fluorescent markers. 

3. A method according to claim 2, wherein the ratio of 
signals is measured using a laser scanning confocaJ micro- 
scope. 

4. A method for determining the fractional binding site 
occupancy of a plurality of binding agents by a plurality of 
analytes in a liquid sample of V liters, comprising: 

(a) loading a plurality of different binding agents, each 
being capable of reversibly binding an anaivte which is 
or may be present in the liquid sample and is specific 
for said analyte as compared to the other components of 
the liquid sample, onto a support at a plurality of spaced 
apart small spots such that each spot has a high coating 
density of one of said binding agents but not more than 
0.1 V/K moles of binding agent are present on any one 
spot, where K liters/mole is the affinity constant of said 
binding agent for said analyte; 

(b) contacting the loaded support with the liquid sample 
to be analyzed, such that each of the spots is contacted 
in the same step with said liquid sample, the amount of 
liquid used m said sample being such that only an 
insignificant proportion of any analyte present in said 
liquid sample becomes bound to said binding agent 
specific for said analyte; and 

(c) thereafter contacting the loaded suppon with site- 
recognition reagents which recognize cither the unfilled 
binding sites or fiUed binding sites of that binding 
agent, the site-recognition reagents being labelled with 
markers from which the fractional binding site occu- 
pancy for each binding agent is determined 

5. The method of claim 4, wherein the site-recognition 
reagents are labelled with fluorescent markers. 

6. The method of claim 4, wherein the presence of the 
site-recognition reagents on each respective binding agent is 
determined consecutively. 

7. The method of claim 4, wherein the presence of the 
site-recognition reagents on each respective binding agent is 
determined simultaneously. 

8. The method of claim 4, further comprising, after step 
(c), calculating the concentration level of each reagent using 
the determined value of the fractional binding site occu- 
pancy. 

9. A method for detecting a plurality of analvtes in a liquid 
sample of volume V liters, comprising: 
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loading a plurality of different binding agents, each being 
capable of reversibly binding an analyte which iTor 
may be present in the liquid sample and is specific for 
said analyte as compared to the other components of 
the liquid sample, onto a suppon means at a plurality of 
spaced apart small spots such that each spot has a high 
coating density of one of said binding agents but not 
more than 0.1 V/K moles of binding agent arc present 
on any spot, where K liters/moles is the aiEnity con- 
stant of said binding agent for said analyte; 
contacting the loaded suppon means with the liquid 
sample to be analyzed, such that each of the spots is 
contacted in the same step with said liquid sample, the 
amount of liquid used in said sample being such that 
only an insignificant proportion of any analyte present 
in said liquid sample becomes bound to said binding 
agent specific for said analyte; 
contacting the suppon with a site-recognition reagent 
specific for each binding agent in a competitive or 
non-competitivc technique, the site-recognition reagent 
being capable of recognizing either the unfilled binding 
sites or the filled binding sites on said binding agent, 
said site-recognition reagent being labelled with a 
marker; and 

measuring the signal from the marker of the site- 
recognition reagent in a panicular location to delect the 
presence of said plurality of analytes in said sample 

10. A method as claimed in claim 9, wherein each of said 
spots has a size of less than 1 mm 2 . 

11. A method as claimed in claim 10, wherein each of said 
spots contains more than 10 4 molecules of binding agent. 

12. A method as claimed in claim 11, wherein each of said 
spots has less than 0.01 V/K moles of binding agent. 

13. A method as claimed in claim 11, wherein said binding 
agents used have affinity constants for said analytes of from 
1(T to 10" liters per mole. 

14. A method as claimed in claim 11, wherein said binding 
agents used have affinity constants for said analytes of the 
order of lO^to 10 J1 liters per mole. 

15. A method as claimed in claim U, wherein the volume 
of said liquid sample is not more than 0.1 liter. 

16 A method as claimed in claim U, wherein the volume 
of said liquid sample is 400 to 1000 microliters. 

17. A method as claimed in claim 9, wherein said binding 
agents loaded onto said suppon means are antibodies for the 
analytes whose concentrations are to be determined 
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ABSTRACT 



The present invention provides methods for determining the 
^*; f » «q«id samples in w£b £ 

amount of binding agent having binding sites specific for a 

zone on a solid support, tbe binding agent being divided into 
an array of spatially separated locations in the test zone The 
concentration of the analyte is obtained by back-litSh e 
occupy bmding agent with a developing agent hS a 

Z^'Z tbeSigDal fro » iLationTL 

ri^L™ P !" U,Vem,0D 8180 P"«*~ * method for 
sttTh?K a ; alUe represent " ive of ■ faction of bulding 
sues of the binding agent which are occupied by tbe analvte 
comprising immobilizing the specific binding on.' 
solid support, wherein tbe specific binding .gem Uie 

V/K moles, where V « the volume of the liquid sample .nd 
£n£ ' be , aSS ? c,a u Uon C0DS,anI for the .nalyte specificaUy 
binding to the binding agent, and where* tbTspedfic 
binding agent is divided into an array of spatially «S 
locations; contacting the support wW. the liquid ^ 
contacting , be support with tbe developing agent- sewX 
non-specifically bound developing *AAJSS££ 
signal at each of the locations to obtain a value which 
represents the fraction of tbe binding site? occuS btS 
analyse a. each location; and adding the meas^d values * 
provide a total signal which indices the fraaion of ,h1 

SSI'S d kd ^- a8enl occupied 

21 Claims, 3 Drawing Sheets 
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Equivalent microspot 
microspot radius = r = r /n 
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BINDING ASSAY 

This application is the U.S. national stage of PCT/GB94/ 
02814, filed Dec. 23, 1994. 

FIELD OF THE INVENTION 

The present invention relates to binding assays, e.g. for 
determining the concentration of analytes in liquid samples. 

BACKGROUND TO THE INVENTION 

It is known to measure the concentration of an analyte, 
sucb as a drug or hormone, in a liquid sample by contacting 
the liquid with a binding agent having binding sites specific 
for the analyte, separating the binding agent having analyte 
bound to it and measuring a value representative of the 
proportion of the binding sites on the binding agent that axe 
occupied by analyte (referred to as the fractional 
occupancy). Typically, the concentration of the analyte in the 
liquid sample can then be determined bv comparing the 
fractional occupancy against values obtained from a series 
of standard solutions containing known concentrations of 
analyte. 

In the past, the measurement of fractional occupancy has 
usually been carried out by back-titration with a labelled 
developing reagent using either so-called competitive or 
non-competitive methods. 

In the competitive method, the binding agent having 
analyte bound to it is back-titrated, either simultaneously or 
sequentially, with a labelled developing agent, which is 
typically a labelled version of the analyte. The developing 
agent can be said to compete for the binding sites on the 
binding agent with the analyte whose concentration is being 
measured. The fraction of the binding sites which become 
occupied with the labelled analyte can then be related to the 
concentration of the analyte in the liquid sample as 
described above. 

In the non-competitive method, the binding agent having 
analyte bound to it is back-tiirated with a labelled develop- 
ing agent capable of binding to either the bound analyte or 
the occupied binding sites on the binding agent. The frac- 
tional occupancy of the binding sites can then be measured 
by detecting the presence of the labelled developing agent 
and, just as with competitive assays, related to the concen- 
tration of the analyte in the liquid sample as described 
above. 

In both competitive and non-competitive methods, the 
developing agent is labelled with a marker. A variety of 
markers have been used in the past, for example radioactive 
isotopes, enzymes, chcmiluminescent markers and fluores- 
cent markers. 

In the field of immunoassay, competitive immunoassays 
have in general been carried out in accordance with design 
principles enunciated by Benson and Yalow, for instance in 
"Methods in Investigative and Diagnostic Endocrinology" 
(1973), pages 111 to 116. Berson and Yalow proposed that 
in the performance of competitive immunoassays, maximum 
sensitivity is achieved if an amount of binding agent is used 
to bind approximately 30 to 50% of a low concentration of 
the analyte to be detected. In non-competitive 
immunoassays, maximum sensitivity is generally thought to 
be achieved by using suflBcient binding agent to bind close 
to 100% of the analyte in the liquid sample. However, in 
both cases immunoassays designed in accordance with these 
widely accepted precepts require the volume of the sample 
to be known and the amount of binding agent used to be 
accurately known or known to be constant. 
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In International Patent Application WO84/01031, I dis- 
closed that the concentration of an analyte in a liquid simple 
can be measured by contacting the liquid sample with a 
small amount of binding agent having binding sites specific 
for the analyte. In this method, provided the amount of 
binding agent is small enough to have only an insignificant 
effect on the concentration of the analyte in the liquid 
sample, it is found that the fractional occupancy of the 
binding sites on the binding agent by the analyte is effec- 
tively independent of the volume of the sample. 

This approach is further refined in EP304,202 which 
discloses that the sensitivity and ease of development of the 
assays in WO84/01031 is improved by using an amount of 
binding agent less than 0.1 V/K moles located on a small area 
(or "microspol") of a solid support, where V is the volume 
of the sample and K is the equilibrium constant of the 
binding agent for the analyte. 

Id WO93/08472, 1 disclosed a method of further improv- 
ing the sensitivity of binding assays by immobilising small 
amounts of binding agent at high density on a support in the 
form of a microspol. In this assay, a developing agent 
comprising a microsphere containing a marker, e.g. a fluo- 
rescent dye, is used to back-titrate the binding agent after it 
has been contacted with the liquid sample containing the 
analyte. As the microsphere can contain a large number of 
molecules of fluorescent dye, the sensitivity of the assay is 
improved as the signal from small amounts of analyte can be 
amplified. This amplification permits sensitive assays to be 
carried out even with microspots having an area of 1 mm 2 
or less and a surface density of binding agent in the range of 
1000 to 100000 molecules/yum 2 . 

SUMMARY OF THE INVENTION 
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The present invention provides a method, device and test 
kit for carrying out a binding assay in which binding agent 
having binding sites specific for a given analyte in a liquid 
sample is immobilised in a test zone on a solid support, the 
binding agent being divided into an array of spatially 
40 separated locations in the test zone, wherein the concentra- 
tion of the analyte is obtained by integrating the signal from 
the locations in the array. 

Accordingly, in one aspect, the present invention provides 
a method for determining the concentration of an analyte in 
a liquid sample comprising: 

(a) locating binding agent having binding sites specific for 
the analyte in a lest zone on a solid support, the binding 
agent being divided into an array of spatially separated 
locations; 

(b) contacting the support with the liquid sample so that 
a fraction of the binding sites at each location become 
occupied by analyte; 

(c) measuring a value of a signal representative of the 
fraction of the binding sites occupied by the analyte for 
each individual location in the array; 

(d) integrating the signal value obtained for each location 
in the array to provide an integrated signal; and, 

(e) comparing the integrated signal to corresponding 
values, obtained from a series of standard solutions 
containing known concentrations of analyte, to deter- 
mine the concentration of the analyte in the liquid 
sample. 

Thus, in the present invention, the values of the signal 
from an array of locations in the test zone are used to 
determine the concentration of a single analyte. This is in 
contrast to the approach described in EP304,202, in which 
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the signal produced at a single location is used to determine 
the concentration of an analyte. 
The array of locations of binding agent in tbe test zone can 

s ^ u ' D,i t Uy ' C g - "** 8 CODl0C3i microscope, 
and tbe signal value from each location integrated to provide 
the integrated signal. Alternatively, tbe array of locations of 
binding agent id the test zone can be viewed together e e 
using a charge coupled device (CCD) camera, with' the 
signal values from each location being measured simulta- 
neously. 

Preferably, the signals representative of tbe fraction of the 
binding sues occupied by binding agent at each location are 
measured by back-titrating the binding agent with a devel- 
oping agent having a marker, the developing agent being 
capable of binding to unoccupied binding sites, bound 
analyte or to occupied binding sites in a competitive or 
non-competitive method, as described above. 

The marker on the developing agent can be a radioactive 
isotope, an enzyme, a cbemiluminescent marker or a fluo- 
rescent marker. The use of fluorescent dye markers is 
especially preferred as tbe fluorescent dyes can be selected 
to provide fluorescence of an appropriate colour range 
(excitation and emission wavelength) for detection Fluo- 
rescent dyes include coumarin, fluorescein, rbodamine and 
Texas Red. Fluorescent dye molecules having prolonged 
fluorescent periods can be used, thereby allowing time- 
resolved fluorescence to be used to measure tbe strength of 
the fluorescent signal after background fluorescence has 
decayed. Advantageously, marker can be incorporated 
within or on the surface of latex microspheres attached* the 
developing agent. This allows a large quantity of marker to 
be assocmed with each molecule of developing agent 
amplifying the signal produced by tbe developing agem ' 
Preferably, the locations are microdots and the assay is 
earned out using 4-tO (or more) microspots for each indi- 
vidual analyte, each microspot having an area less than 
10000 Aim , tbe m.orospote being separated from each other 
by a distance of 100-1000 m . The locations within the 
array are referred to as "mini-microspots" in tbe relevant 
parts of the description that follow. 

The present invention also allows tbe concentration of a 
plurality of analytes to be determined simultaneously bv 
providing a plurality of test zones, each test zone having 
immobilised in it a iota] amount of binding agent having 
binding sites specific for a given analyte in a liquid sample, 
the binding agent being divided into an array of spatiallv 
separated locations in the test zone. 

Preferably, in accordance with EP304.202 the total 
amount of binding agent in each array that is specific for a 
given analyte is less than 0.1 V/K moles, wbere V is the 
volume of the sample applied to the test zone and K is the 
association constant for analyte binding to tbe binding agent 
£,™ UrCS """ ,bC " ambie «" """lye" conditions described 
m WO83/01031 are fulfilled regardless of the anaiyTcon 
centra uon. 

One way of immobilising binding agent on a support at a 
discrete location such as a microspot is to use technoloev 
comparable to tbe techniques used in ink-jet or laser printers 
in the case of microspots typically providing spots having 
diameter of about 80 m . Alternatively, if larger locations 
are required, a micropipette can be used to control the 
amount of binding agent immobilised at a location on a 
support. 

The present invention is based on tbe observation that as 
the area of a microspot is reduced from a high value, such 
as 5 mm 3 towards zero, tbe sensitivity of tbe binding assav 
(represented by the lower limit of detection) reaches a 
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maximum when the microspot reaches . small, but finite 
«ea, typically around 0.1 mm*. Further reducing the areT f 
microspot leads to a reduction in the sensitivity of tbe 
bmdmg assay. However, sensitivity is enhanced by subdi- 
viding a microspot of any given area into multiple mini- 
microspots, such that tbe total coaled area occupied by tbe 
mini-microspots, and hence the total amount of binding 
agent remains tbe same. 

In addition, the use of an array of microspot for each 
individual analyte allows the user to determine whether the 
value obtained for any given microspot is in error by 
comparison with other microspots in the array 

In a further aspect, tbe present invention provides a device 
for determining tbe concentration of one or more analytes in 
a liquid sample, the device comprising a solid support 
having one or more test zones, each test zone having 
immobilised in it an amount of binding agent having binding 
sues specific for a given analyte in a liquid sample, the 
binding agent being divided into an array of spatially 
separated locations in the test zone, wherein the concentra- 
tion of a given analyte is obtained by integrating signal 
values from each location in the array. 

In a further aspect, the present invention provides a kit for 
determining the concentration of one or more analytes in a 
liquid sample, the kit comprising: 

(a) a device comprising a solid support having one or 
more test zones, each test zone having immobilised in 
it an amount of binding agent having binding sites 
specific for a given analyte in a liquid sample, tbe 
binding agent being divided into an array of spatiaDy 
separated locations in tbe lest zone; and, 

(b) one or more developing agents for determining tbe 
fraction of the binding siles of tbe binding agent 
occupied by a given analyte, the developing agents 
having markers, the developing agents being capable of 
binding to bound analyte, or unoccupied or occupied 
binding sites of tbe binding agent; 

wherein tbe concentration of a given analyte is obtained by 
integraung signal values from the markers of the developing 
agent at each location in tbe array. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
The unexpected observation of a microspot area yielding 
a maximum sensitivity is thought to arise because a number" 
of opposing effects combine to produce this outcome. These 
effects are explained with reference to tbe accompanying 
figures in which: r "J'"n 

FIG- 1 represents bow tbe sensitivity of a binding assay 
typically changes wub area of microspot at constant binding 
agent density; 6 

FIG. 2 represents tbe typical variation in signal-to-noise 
ratio as the area of a microspot changes; 

FIG. 3 represents how diffusion constraints on analyte 
binding to the binding agent change as the area of the 
microspot changes; 

FIG. 4 shows bow the error in signal measurement 
changes as the area of the microspot changes; 
FIG. 5 shows a comparison between the microspot array 
60 of tbe present .nvention and a single microspot of the prior 

an; and r 

FIG. 6 shows binding agent immobilised as an array of 
lines in an alicraative embodiment of the invention. 

DETAILED DESCRIPTION 
Note thai i in FIGS. 1 to 4, tbe exact shape of tbe curves 
shown will depend on a number of parameters, including the 
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physico-chemical properties (ie association and dissociation 
rate constants) of the binding agent, the viscosity of the 
analyte containing solution to which the microspot is 
exposed, the specific activity of the label used, etc. 

In all the figures, value A denotes the area of a microspot 
typically used in the prior an (typically 1 mm 2 ). In all the 
figures, the density of binding agent is kept constant. 

FIG. 1 shows the experimentally observed variation of 
sensitivity as the area of a microspot is reduced In the 
present context, sensitivity can be defined as the lower limit 
of detection which is given by the error (s.d) with which it 
is possible to measure zero signal. As FIG. 1 shows as the 
area is reduced from value A, the sensitivity of the binding 
assay reaches a maximum and then declines as the area of 
the microspot is further reduced towards zero. 

Some of the opposing factors leading to this observation 
are depicted in FIGS. 2 to 4. 

FIG. 2 shows how the signal-to-noise ratio associated 
with the measurement of the occupancy of the binding sites 
of the binding agent changes as the size of the microspot 
decreases towards zero, assuming equilibrium has been 
reached. As microspot area is reduced from value A, the 
fractional occupancy of the binding sites of the bindine 
agent reaches a plateau value as the concentration of binding 
agent falls below 0.01/K. Therefore, the signal per unit area 
from markers on developing agent used to measure the 
occupancy of the binding sites by analvte will also reach a 
plateau. As the background noise per unit area remains 
approximately constant, so the signal -to-noise ratio wiU 
likewise increase to a plateau value as the concentration of 
binding agent falls below 0.01/K. 

FIG. 3 shows how diffusion constraints change as the area 
of a microspot is reduced. "Diffusion constraints" restrict the 
rale at which analyte migrates towards and binds to the 
binding agent. As FIG. 3 snows, the diffusion constraints 
decrease as microspot size decreases, ie the kinetics of the 
binding process are faster for smaller microspols, implying 
that thermodynamic equilibrium in the system is reached 
more rapidly. 

On a molecular level, this phenomenon can be pictured as 
follows. When a microspot containing binding agent is 
placed in a liquid sample containing analyte, the binding 
agent binds anaJyte, depleting the local concentration of the 
analyte as compared to the liquid sample as a whole This 
leads to a concentration gradient being established in the 
vicinity of the microspot until thermodynamic equilibrium is 
reached. This process is found to be slower for larger 
microspots the difiusion constraint being approximated 
proportional to microspot radius. When the occupancy of the 
binding sites on the binding agent has reached an equilib- 
rium value, the concentration of analvte in tbe liquid sample 
is uniform. However, equilibrium is reached more rapidlv in 
the case of microspots of smaller size, implying that for any 
incubauon time less than that required to reach equilibrium 
in the case of tbe larger spot, the fractional occupancy of the 
binding sites on the smaller spot is greater. 

However, as microspot area decreases, so the amount of 
binding agent and tbe level of signal from developing agent 
will likewise decrease. This leads to an increase in the 
statistical errors in the measurement of the signal from a 
marker on a developing agent, which tend to infinity as the 
microspot area tends to zero (see FIG. 4). 

It can be seen that a consideration of the signal-to-noise 
ratio and diffusion constraints indicate an increase in the 
sensitivity of a binding assay as the area of a microspot is 
decreased. However, these factors arc opposed by an 
increase in tbe statistical error of signal measurement as the 
microspot area decreases. These factors combine to produce 
the observed variation of sensitivity with microspot area 
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shown in FIG. 1 Thus, the overall consequence is thai, as 
microspot area falls to zero, the binding assay becomes 

totally insensitive. 

However, it is desirable to develop sensitive miniaturised 
binding > assays using microspots of the smallest possible size 
containing yanishingly small amounts of binding agent, that 
nave rapid kinetics to minimise the time taken to carry out 

the assay. 3 

The present invention improves sensitivity and reduces 
binding assay incubation times by exploiting the contradic- 
tory effects discussed above to maximal advantage. TTus is 
done by sub-dividing the total amount of binding agent into 
an array of spatially separated locations such as "mini- 
microspots", to reduce diffusion constraints, and integrating 
tbe signals representative of the fractional occupancy of 
binding agent at each location to obtain a total signal greater 
than would have been achieved by using a singlf microspot 
equal in area to the total area occupied by the minimi- 
crospots comprising the minimicrospol array. 

This implies, inter alia, that tbe total amount of binding 
agent used can be made even smaller than in the prior an 
where a balance between kinetics and signal-to-noise rela- 
tive to statisucaJ errors had to be made to optimise sensi- 
tivity Tbe present invention therefore can improve the 
signal-to-noise ratio associated with measuring the analyte 
bound to binding agent, whilst reducing the diffusion con- 
straints associated with each microspot in tbe array 
Moreover, the increasing statistical errors observed in the 
pnor an as microspot size is reduced are obviated, as the 
signal generated from the occupied binding sites by analyte 
m the individual microspots is integrated over tbe array to 
provide an integrated signal, thereby retaining the signal 
measurement advantage observed for larger microspols 

FIG. 5 illustrates how a single microspot of the prior art 
can be div,ded inio an array of 25 microspols containing an 
equivalem total amount of binding agent. 

Nevertheless, other arrangements or geometries of bind- 
ing agent providing assays yielding the same benefits can be 
envisaged, see for instance FIG. 6 which shows binding 
agent immobilised as lines forming a grid (see the shaded 
areas). Tins configuration likewise has the effect of reducing 
the diffusion constraints whilst maintaining the total area 
coated with binding agent (e.g. an antibody) to obviate the 
increasing statistical errors and associated loss of sensitivity 
observed as tbe amount of binding agent is reduced. 

Tbe amount and distribution of the binding agent in tbe 
locations comprising tbe array depends on a variety of 
factors including the diffusion characteristics of the analyte 
the narure and viscosity of tbe liquid sample containing the 
analyte and the protocol used during incubation. However 
given the guidance here the skilled person can readily 
determine, either experimental or by computer modelling 
the optimal arrangement or geometry of array for any given 
binding assay. 3 & 

EXAMPLE 

Conjugation of Anti-TSH (Anti-Thyroid 
Stimulating Hormone) Mouse Monoclonal 
Antibody to Fluorescent Hydropbilic Latex 
Microspheres 

1. 10 mg of fluorescent bydrophilic latex microspheres in 
0.5 ml double distilled water were added to 0.5 ml of 1% 
TWEEN 20, surface-active agent, shaken for 15 min at room 
temperature and centrifuged at 8° C. for 10 min at 20 000 
rpm in a MSE High-Spin 20 ul trace mrifuge. 

2. The pellet was dispersed in 2 ml of 0.05M MES 
(2-[N-Morpholino] cthanesulfonic acid) buffer, pH6.1 and 
centrifuged. 



30 



35 



5,837,551 



3. Step 2 was repeated. 

4. The pellet was dispersed io 0.8 ml MES buffer. 

, J* 2 , mg of a f ) ! i *J SH monoclonal developing antibody in 
100 /d were added to the microspheres and shaken for 15 
nun at room temperature. 

LI™? 1 u f J °- 25 t ? ^ yI - 3 ("«M*yI amino) propyl 
carbodimidc hydrochloride were added to the mixture and 
shaken for 2 hours at room temperature. 

7. 10 mg glycine in 100 /d of MES buffer were added to 
the mixture, shaken for a further 30 min and ccntrifuged 

8. 71k : pellet was dispersed in 2 ml of 1% BSA (Bovine 
Serum Albumin), shaken for 1 hour at room temperature and 
ccntrifuged. 

9. The pellet was dispersed in 2 ml of 1 % BSA, shaken for 
1 hour at room temperature and ccntrifuged. 

10. Hie pellet was dispersed in 2 ml of 0.1M phosphate 
buffer, pH7.4 and ccntrifuged. 

11. Step 10 was repealed twice. 

12. The pellet was dispersed in 2 ml of 1% BSA contain- 
ing 0.1 sodium azide and stored at 4° C. 

Comparison of Kinetics of Micro Versus Mini- 
micro Capture Antibody Microspots in a Sandwich 
TSH (Thyroid Stimulating Hormone) Assay 
1. Anti-TSH capture antibody microspots (diameter 1 1 
mm, area-10 6 /m 2 ) were made by depositing 0.5 of 200 
jug/ml antibody solution on each of 16 Dvnatech black 
MicroFluor Microtitrc wells, the droplets were aspirated 
immediately, the wells blocked with SuperBJock from 
Pierce for 30 mm at room temperature and washed with 
0.1M phosphate buffer, pH7.4. 

2. The mini-microspots (diameter 0.16 mm) were made 
using an piezoelectric ink-jet print-bead with an anti-body 
solution concentration of 1 mg/ml and droplets of approxi- 
mately 100 pi picoliter for an array of 49 (7x7) mini- 
microspots per microtiire wells (total coated antibody area- 
10 m ) for 16 wells. The wells were blocked with 
SuperBlock and washed with phosphate buffer as above The 
coated antibody density for both micro and mini-microspots 
are estimated to be 2x10* IgG//an 2 . 

3. 200 /il of plasma containing 1 ^U/ml of TSH was added 
to all the microtitrc wells and shaken at room temperature 
At 30, 60, 120 mm and 18 hours (overnight), four wells 
containing the microspots and mini-microspots were washed 
with phosphate buffer containing 0.1% TWEEN 20 then 
incubated with 200 M l of anti-TSH developing antibody 
conjugated to hydrophilic latex microspheres in Tris-HCI 
buffer (50 Mg/ml) for 30 min at room temperature and 
washed with phosphate-TWEEN 20 buffer. The wells were 
then scanned with a laser scanning confocal microscope 
equiped with an Argon/Krypton laser. 

Results 
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Sample 

incubation 


Totil Fluorescent Signal 
(arbitrary units^ 


55 


times (mins) 


Microspot 


Mint- microspot 




30 
60 
120 

Overnight 


85 * 7 
338 * 15 
341 * 21 
185 x 20 


333 * 13 
349 * 16 
378 x 16 
391 x 23 


60 



Conclusion 

Significantly higher mean responses were observed 
between 30 and 120 mins in the mini-micro spot samples, 
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while toe overnight controls did not show significant differ- 
ences. This demonstrates that the mini-microspots have 
faster kinetic for the association of analyte with the capture 
antibody, and could be used to reduce incubation times. 
The invention claimed is: 

1. A method for determining the concentration of at least 
one analyte in a liquid sample, said method comprising for 
each analyte, the steps of: 

(a) immobilizing a specific binding agent including bind- 
ing sites specific for the analyte on a solid support 
wherein the specific binding agent used to determine' 
the concentration of the analyte is present in an amount 
less than 0.1 V/K moles, where V is the volume of the 
liquid sample and K is the association constant for the 
analyte specifically binding to the specific binding 
agent, and wherein said specific binding agent is 
divided into an array of spatially separated locations; 

(b) contacting the support with the sample so that a 
fraction of the binding sites of the specific binding 
agent specific for the analyte specifically binds the 
analyte; 

(c) contacting the support with a developing agent 
labelled with a signal-producing marker such that the 
labelled developing agent binds to unoccupied binding 
sites, to specifically bound analyte or to the binding 
sites with specifically bound analyie; 

(d) separating non-specifically bound developing agent 
from the solid support and measuring the signal pro- 
duced by the marker at each of the locations in the array 
to obtain a value which represents the fraction of the 
binding sites occupied by the analyte at each location; 

(e) adding the values obtained at the locations in the array 
to provide a total signal; and 

(f) comparing the total signal to corresponding values 
obtained from a series of standard solutions containing 
known concentrations of the analyte, to determine the 
concentration of the analyte in the liquid sample. 

2. The method according to claim 1, wherein the specific 
binding agent is divided into between 4 and 40 locations 

3. The method according to claim 1, wherein the locations 
have an area of about 30000 ,/m 2 , the locations being 
separated from each other by a distance of 100 to 1000 ^m. 

4. Toe method according to claim 1, wherein the concen- 
tration of a plurality of different analytes in the liquid sample 
are determined using a plurality of arrays on said support. 

5. The method according to claim 1, wherein (i) the 
specific binding agent is an antibody and the analvte is an 
antigen or (n) the specific binding agent is an oligonucle- 
otide and the analyte is a nucleic acid. 

6. A method for determining the concentration of at least 
one analyte in a liquid sample, said method comprising for 
each analyie, the steps of: 

(a) immobilizing a specific binding agent including bind- 
ing sites specific for the analyte on a solid support 
wherein the specific binding agent used to determine 
the concentration of the analyie is present in an amount 
less than 0.1 V/K moles, where V is the volume of the 
liquid sample and K is the association constant for the 
analyte specifically binding to the specific binding 
agent, and wherein said specific binding agent is 
divided into.an array of spatially separated locations; 

(b) contacting the support with the liquid sample so that 
a fraction of the binding sites of the binding agent 
specific for the analyte specifically bind the analyte; 

(c) contacting the support with a developing agent 
labelled with a signal-producing marker such that the 
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labelled developing agent binds to unoccupied binding 
sites, to specifically bound analyte or to tbe binding 
sites with specifically bound analyte; 

(d) separating non-specifically bound developing agent 
from tbe sob'd support and measuring tbe signal pro- 
duced by tbe marker at each of tbe locations in the array 
to obtain a value wbicb represents tbe fraction of tbe 
binding sites occupied by tbe analyte at eacb location; 
and 

(e) adding tbe values obtained at tbe locations in tbe array 
to provide a total signal wbicb indicates the concen- 
tration of tbe analyte in tbe liquid sample. 

7. The method according to claim 6 wberein the specific 
binding agent is divided into between 4 and 40 locations. 

S. The method according to claim 6, wherein the locations 
are in an area of about 10000 fan 7 , the locations being 
separated from each other by a distance of 100 to 1000 /an. 

9. The method according to claim 6, wherein the concen- 
trations of a plurality of different analytes in the b'quid 
sample are determined using a plurality of arrays on said : 
support. 

10. The method according to claim 6, wberein (i) the 
specific binding agent is ao antibody and the analyte is an 
antigen or (ii) the specific binding agent is an oligonucle- 
otide and tbe analyte is a nucleic acid. 

11. A method for determining tbe concentration of at least 
one analyte in a liquid sample, said method employing a 
solid support on which is immobilized, for eacb analyte, a 
specific binding agent including binding sites specific for the 
analyte, wherein the specific binding agent used to deter- 
mine the concentration of tbe analyte is present in an amount 
less tban 0.1 V/K moles, where V is the volume of the liquid 
sample and K is tbe association constant for tbe analyte 
specifically binding to tbe specific binding agent, and 
wherein said specific binding agent is divided into an array 
of spatially separated locations, said method comprisine the 
steps of: 

(a) contacting the support with the sample so that a 
fraction of the binding sites of the specific binding 
agent specific for the analyte specifically binds the 
analyte; 

(b) contacting the support with a developing agent 
labelled with a signal-producing marker such that the 
labelled developing agent binds to unoccupied binding 
sites, to specifically bound analyte or to tbe binding 
sites with specifically bound analyte; 

(c) separating non-specifically bound developing agent 
from the solid support and measuring tbe signal pro- 
duced by tbe marker at eacb of the locations in the array 
to obtain a value which represents the fraction of the 
binding sites occupied by tbe analyte at eacb location; 

(d) adding tbe values obtained at tbe locations in tbe array 
to provide a total signal; and 

(e) comparing the total signal to corresponding values 
obtained from a series of standard solutions containing 
known concentrations of (he analyte, to determine the 
concentration of the analyte in the liquid sample. 

12. The method according to claim 11, wherein tbe 
specific binding agent is divided into between 4 and 40 
locations. 
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13. The method according to claim 11, wberein tbe 
locations have an area of about 10000 /an 2 , tbe locations 
being separated from each other by a distance f 100 to 1000 

fJOXX. 

14. Tbe method according to claim 11, wberein tbe 
concentrations of a plurality of different analytes in tbe 
liquid sample arc determined using a plurabry of arrays on 
said support 

15. The method according to claim U, wberein tbe 
specific binding agent is an antibody and tbe analvte is an 
antigen. 

16. Tbe method according to claim 11, wherein the 
specific binding agent is an oligonucleotide and tbe analyte 
is a nucleic acid. 

17. A method for determining a value representative of a 
fraction of binding sites of a specific binding agent including 
binding sites specific for an analyte which binding sites are 
occupied by the analyte present in a liquid sample, said 
method comprising the steps of: 

(a) immobilizing tbe specific binding agent on a solid 
support, wberein the specific binding agent used for the 
fractional occupancy determination is present in an 
amount less than 0.1 V/K moles, where V is the volume 
of the liquid sample and K is the association constant 
for the analyte specifically binding to tbe specific 
binding agent, and wherein said specific binding agent 
is divided into an array of spatially separated locations; 

(b) contacting the support with tbe liquid sample so that 
a fraction of tbe binding sites of tbe binding agent 
specific for tbe analyte specifically bind the analyte; 

(c) contacting tbe support with a developing agent 
labelled with a signal-producing marker such that the 
labelled developing agent binds to unoccupied binding 
sites, to specifically bound analyte or to the binding 
sites with specifically bound analyte; 

(d) separating non-specifically bound developing agent 
from the solid support and measuring the signal pro- 
duced by tbe marker at each of tbe locations in the array 
to obtain ^a value which represents the fraction of tbe 
binding sites occupied by the analyte at each location; 
and 

(e) adding the values obtained at the locations in the array 
to provide a total signal which indicates the fraction of 
the binding sites in the specific binding agent occupied 
by the analyte. 

18. The method according to claim 17, wherein tbe 
specific binding agent is divided into between 4 and 40 
locations. 

19. Tbe method according to claim 17, wberein tbe 
locations are in an area of about 10000 A<m 2 , the locations 
being separated from each other by a distance of 100 to 1000 

20. Tbe method according to claim 17, wberein tbe 
fraction of occupied binding sites is determined for a phi- 
rality of different analytes in tbe liquid sample using a 
plurality of arrays on said support. 

21. The method according to claim 17, wberein (i) tbe 
specific binding agent is an antibody and tbe analyte is an 
antigen or (ii) the specific binding agent is an oligonucle- 
otide and tbe analyte is a nucleic acid. 
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ABSTRACT Pairwise sequence companion methods have 

[SHL .F^ prOtei0 * Wh0M •« known 

reliably from their structures and functions, as described in 

f. da i ab «» e fMurxin.A.G,Brenner,S.E,Hubbard,T 
* Chothi. C. (1995) J. Aid. Biol. 247, 536-540). Tn.Vr.iu.: 

Mm le$ ^ d w' Pr ° g ?, ms BLACT lAIUchul, S. F„ Gish, 

yKFllSr* W ' * L,pB " n ' D - J ' (I990 > *«• 
215 403-410), W-BUsn fAltschul, S. F. & Gish, W. (1996) 

Methods Enzymal. 266, 460-480), fasta [Pearson, W. R. & 

Lipman, D. J. (1988) Pnc. Natl. Acad. Sri. USA 85 2444-24481 

£ ^^cK* ^ f - * W » te ™«. M. S. (1981) /. jw! 
A*/. 147, 195-197) and their scoring schemes. The error rate 
of all algorithms is greatly reduced by „si Dg statistical Mores 
to evaluate matches rather than percentage identity or raw 
^ E " Ta,ne statisti «l "ores of ssearch and facta are 
reliable: the number or false positives found in our tests agrees 
well with the scores reported. However, the P-values reported 
by blast and wv-blasti exaggerate significance by orders of 
magnitude, ssearch, Facta ktnp = 1, and wu-blactj perform 
besMnd they are capable of detecting almost all relationships 
between proteins whose sequence identities are >30% For 
mon 'distantly related proteins, they do much less well; only 
one-half or the relationships between proteins with 20-30% 
identity are found. Because many homologs have low sequence 
similarity, most distant relationships cannot be detected by 
any pairw.se comparison method; however, those which are 
identified may be used with confidence. 

Sequence database searching plays a role in virtually every 
branch of molecular biology and is crucial for interpreiing the 
sequences issuing forth from genome projects. Given the 
method s central role, it is surprising that overall and relative 
capabi ht.es of different procedures are largely unknown. It is 
difficult to verify algorithms on sample da'ta because this 
requires large data sets of proteins whose evolutionary rela- 
tionships are known unambiguously and independently of the 
me hods being evaluated. However, nearly all known ho- 
mologs have been identified by sequence analysis (the method 
to be tested). Also, it is generally very difficult to know, in the 
absence of structural data, whether two proteins that lack clear 
sequence similarity are unrelated. This has meant that al- 
though previous evaluations have helped improve sequence 
compar.son, they have suffered from insufficient, imperfectly 
characterized, or artificial test data. Assessment also has been 
problematic because high quality database sequence searching 
attempts to have both sensitivity (detection of homologs) and 
specificity (rejection of unrelated proteins): however these 
complementary goals are linked such that increasing one 
causes the other to be reduced. 

The publication eoiu of this article were defrayed in pan bv page chante 
payment. Thu article mtm therefore be hereby mark J "JJaZSZ 
accordance wuh 18 VS.C. ,1734 solely to indicate thu faci 
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Sequence comparison methodologies have evolved rapidly 

iE?K7 comm01 * "*«»• F« ««mple. parameters ? n 
™„ ( h8Ve Chang ! £t Md ^-BlAST! (2 )_which produces 
SSEL^rtf ments - has become mailable. The latest versiM 

3 0 P™"* 8 * fundamentally different results in the 
form of statistical scoring. 

The previous reports also have left gaps in our knowledge. 

threshSo 6 ' ,hefe h8 \ been "° P ublished «~"« of 
Aresholds for scoring schemes more sophisticated than pe r . 

Thus, the widely discussed statistical scortag 
b««T ? VC ^Jf"""* evaluated on large d*«? 
^ P™™- Moreover, the different scoring themes 
commonly in use have not been compared 

in J h6Se isSUeS ' * ere * a more fundamental question 

in an absolute sense, how well does pairwise sequence com- 

ZT??"?" 1 «• Whit frac,ion ° f "omo.ogous pro'ems 
car. be detected using modem database searching methods' 
In this work, we attempt to answer these questions and to 
overcome bo th of th ^amenta! difficulties that Save bin 
dered assessment of sequence comparison methodologies 
* "* 1 « ° f •? Um evoluti ° I »n' relationships in £ 
scop. Structural Classification of Proteins database (4? which 

t\™» ^ om k s,rUC,,,ral and func,ional characteristics (5) 

n?o!„« 1 PrOVideS 8 ""^'y re,iable "« of hcl 

moiogs. which are known independently of sequence compar- 

b«„ «• We ." Se a " a ^ essmenI m «hod that jointly mea- 
«£!hK Se " s,,,v,t y Md Vocificity. This melhod allows 
2 V d l om ? artson of differen. sequence searching 
procedures. Further, n can be used to aid interpretation of r£l 
database searches and thus provide optimal and reliable 

Previous Assessments of Sequence Comparison. Several 
previous stud.es have examined the relative performance of 
different sequence comparison methods. The most enconv 
passing analyses have been by Pearson (6. 7). who compared 
the three most commonly used programs. Of these, the Smith- 
waterman algorithm (8) implemented in ssearch (3) is the 
oldest and slowest but the most rigorous. Modem heuristics 
have prov.ded blast (1) the speed and convenience to make 
the most popular program. Intermediate between these two 
is fast a (3) which may be run in two modes offering either 
greater speed (ktup = 2) or greater effectiveness (ktup " , ) 
Pearson also considered different parameters for each of these 
programs. 

To lest the methods, Pearson selected two representative 
proteins from each of 67 protein superfamilies defined by the 
P.R diabase (9). Each was used as a query to search the 
database, and the matched proteins were marked as being 
homologous or unrelated according to their membership of pir 

Abbrevialion: EPQ. errors per query. 

Present address: Department of Structural Biology Stanford Uni 
vemty. Farchild Building D-109. Stanford. CA 94 3M 5126 

^«Kdu reqUeS,S Sh ° U,d bC addfeUed - b '"»*'C«. 
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superfamilies. Pearson found that modern matrices and "In- 
scaling" f raw scores improve results c nsiderably. He also 
reported that the rigor us Smith-Waterman algorithm worked 
slightly better than fasta, which was in turn m re effective 
than blast. 

Very large scale analyses of matrices have been performed 
(10), and Kenikoff and Henikoff (11) also evaluated the 
effectiveness of blast and Fasta. Their test with blast 
considered the ability to detect homologs above a predeter- 
mined score but had no penalty for methods which also 
reported large numbers of spurious matches. The Henikoffs 
searched the swiss-prot database (12) and used PROsrre (13) 
to define homologous families. Their results showed thai the 
BLOSUM62 matrix (14) performed markedly better than the 
extrapolated PAM-series matrices (15), which previously had 
been popular. 

A crucial aspect of any assessment is the data that are used 
to test the ability of the program to find homologs. But in 
Pearson's and the Henikoffs' evaluations of sequence com- 
parison, the correct results were effectively unknown. This is 
because the superfamilies in pir and prosfte are principally 
created by using the same sequence comparison methods 
which are being evaluated. Interdependency of data and 
methods creates a "chicken and egg" problem, and means for 
example, that new methods would be penalized for correctly 
identifying homologs missed by older programs. For instance, 
immunoglobulin variable and constant domains are clearly 
homologous, but pir places them in different superfamilies. 
The problem is widespread: each superfamily in pir 48.00 with 
a structural homolog is itself homologous to an average of 1 6 
other pir superfamilies (16). 

To surmount these sorts of difficulties, Sander and Schnei- 
der (17) used protein structures to evaluate sequence com- 
parison. Rather than comparing different sequence compari- 
son algorithms, their work focused on determining a length- 
dependent threshold of percentage identity, above which all 
proteins would be of similar structure. A result of this analysis 
was the hssp equation; it states that proteins with 25% identity 
over 80 residues will have similar structures, whereas shorter 
alignments require higher identity. (Other studies also have 
used structures (18-20), but these focused on a small number 
of model proteins and were principally oriented toward eval- 
uating alignment accuracy rather than homology detection.) 

A general solution to the problem of scoring comes from 
statistical measures (i.e., E-values and P-values) based on the 
extreme value distribution (21). Extreme value scoring was 
implemented analytically in the blast program using the 
Karlin and Altschul statistics (22, 23) and empirical ap- 
proaches have been recently added to fasta and ssearch. In 
addition to being heralded as a reliable means of recognizing 
significantly similar proteins (24, 25), the mathematical trac- 
tability of statistical scores "is a crucial feature of the blast 
algorithm" (1 ). The validity of this scoring procedure has been 
tested analytically and empirically (see ref. 2 and references in 
ref. 24). However, all large empirical tests used random 
sequences that may lack the subtle structure found within 
biological sequences (26, 27) and obviously do not contain any 
real homologs. Thus, although many researchers have sug'- 
gested that statistical scores be used to rank matches (24, 25, 
28), there have been no large rigorous experiments on biolog- 
ical data to determine the degree to which such rankings are 
superior. 

A Database for Testing Homology Detection. Since the 
discovery that the structures of hemoglobin and myoglobin are 
very similar though their sequences are not (29), it has been 
apparent that comparing structures is a more powerful (if less 
convenient) way to recognize distant evolutionary relation- 
ships than comparing sequences. If two proteins show a high 
degree of similarity in their structural details and function, it 
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is very probable that they have an evolutionary relationship 
tnougn their sequence similarity may be low. 

The recent growth f protein structure information com- 
bined with the comprehensive evolutionary classification in 
the scop database (4, 5) have allowed us t overcome previous 
limitations. With these data, we can evaluate the performance 
of sequence comparison methods on real protein sequences 
whose relationships are known confidently. The scop database 
uses structural information to recognize distant homologs. the 
large majority of which can be determined unambiguously. 
These superfamilies, such as the globins or the immunoglobu- 
lins would be recognized as related bv the vast majority of the 
aarity 1 communitv dcs P ilc lh * ^ck of high sequence sim- 

From scop we extracted the sequences of domains of 
proteins in the Protein Data Bank (pdb) (30) and created two 
databases. One (pdbmd-b) has domains, which were all <90% 
identical to any other, whereas (POB4or>B) had those <A0% 
identical. The databases were created by first sorting all 
protein domains in scop by their quality and making a list. The 
highest quality domain was selected for inclusion in the 
database and removed from the list. Also removed from the list 
(and discarded) were all other domains above the threshold 
level of identity to the selected domain. This process was 
repeated until the list was empty. The PDB40D-B database 
contains 1,323 domains, which have 9,044 ordered pairs of 
distant relationships, or -0.5% of the total 1.749,006 ordered 
pairs. In pdbwd-b, the Z079 domains have 53.988 relation- 
ships, representing 12% of all pairs. Low complexity regions 
or sequence can achieve spurious high scores, so these were 
masked in both databases by processing with the sec program 
(27) using recommended parameters: 12 1.8 Z0. The databases 
used m this paper are available from htrp://sss.stanford.edu/ 
sss/, and databases derived from the current version of scop 
may be found at http://scop.mrc-lmb.cam.ac.uk/scop/. 

Analyses from both databases were generally consistent, but 
PDB40D-B focuses on distantly related proteins and reduces the 
heavy overrepresentation in the pdb of a small number of 
families (31, 32). whereas pdbwd-b (with more sequences) 
improves evaluations of statistics. Except where noted other- 
wise, the distant homolog results here are from PDB40D-B. 
Although the precise numbers reported here are specific to the 
structural domain databases used, we expect the trends to be 
general. 

Assessment Data and Procedure. Our assessment of se- 
quence comparison may be divided into four different major 
categories of tests. First, using just a single sequence compar- 
ison algorithm at a time, we evaluated the effectiveness of 
different scoring schemes. Second, we assessed the reliability 
of scoring procedures, including an evaluation of the validity 
of statistical scoring. Third, we compared sequence compari- 
son algorithms (using the optimal scoring scheme) to deter- 
mine their relative performance. Fourth, we examined the 
distribution of homologs and considered the power of pairwise 
sequence comparison to recognize them. All of the analyses 
used the databases of structurally identified homologs and a 
new assessment criterion. 

The analyses tested BLAST (1), version 1.4.9MP, and wu- 
BLast: (2), version 2,0a 13MP. Also assessed was the Fasta 
package, version 3.0t76 (3), which provided Fasta and the 
ssearch implementation of Smith-Waterman (8). For 
ssearch and fast a, we used BLOSUM45 with gap penalties 
-12/-1 (7, 16). The default parameters and matrix (BLO- 
SUM62) were used for blast and wu-blast2. 

The "Coverage Vs. Error" Plot. To test a particular protocol 
(comprising a program and scoring scheme), each sequence 
from the database was used as a query to search the database/ 
This yielded ordered pairs of query and target sequences with 
associated scores, which were sorted, on the basis of their 
scores, from best to worst. The ideal method would have 
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perfect separation, with all of the homologs at the top of the 
I st and unrelated proteins below. In practke perfec^elV,, 
t.on ,s impossible ,0 achieve so instep onet ed 

of r 3 ea ,hrMh0 l d 8b0Ve Which there are numb 
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Errors per query (EPQ), an indicator of selectnl % , he 

by the number of queries. Graphs of these data called 
coverage vs. error plots, were devised to understand how 
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Fig. 4. Reliability of statistical scores in pdwod-b: Each line shows 
the relationship between reported statistical score and actual error 
rate for a different program. E-values are reported for sseaACH and 
Fast a. whereas P-values are shown for blast and wu-blastt. If the 
scoring were perfect then the number of errors per query and the 
E-values would be the same, as indicated by the upper bold line 
(P-values should be the same as EPO for small numberTan^er^ 
at higher values, as indicated by the lower bold line.) E-vaiues from 
ssearch and fasta are shown to have good agreement with EPO but 
underestimate the significance slightly, blast and wu-blast? are 
overconfident, with the degree of exaggeration dependent upon the 
score. The results for pdjmod-b were similar to those for pdWb 
despite the difference in number of homologs detected. This graph 
could be used to roughly calibrate the reliability of a given statistical 

ignored in previous tests but is essential for the straightforward 
or automatic interpretation of sequence comparison results 
Further it provides a clear indication of the confidence that 
shou d be ascribed to each match. Indeed, the EPQ measure 
should approximate the expectation value reported bv data- 
base searchmg programs, if the programs' estimates are accu- 
rate. 

The Performance or Scoring Schemes. All of the programs 
tested could provide three fundamental types of scores The 
first score is the percentage identity, which may be computed 
in several ways based on either the length of the alignment or 
;.c - J 1 ?,, ,he sei > uences - The second is a "raw" or 
Smith-Waterman" score, which is the measure optimized bv 
the Smith-Waterman algorithm and is computed bv summine 
the substitution matrix scores for each position in' the align- 
ment and subtracting gap penalties. In blast, a measure 
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\ K ° ^ ml bi * Thinl is a statistical 
score based on the extreme value distribution. These resuiu 
are summarized in Fig. 1. u 

JSm ■"■J - "^- Th °" gh " has becn ,on ? «»bl*»»«J th« 

ESSf k ""^ * 3 ?° T measure < 35 >- there " * «>nunon 
ruh>of. t humb staung that 307c identity signifies homology. 
Moreover, publications have indicated that 25* identirv call 
be used as a threshold (17. 36). We find that these threshokfc 

results. As databases have grown, so have the possibilities for 
chance alignments with high identity: thus, the reported cWffe 
lead to frequent errors Fig. 2 shows one of the mam pairs of 
proteins with very different structures that nonetheless have 

E& k K ,d , em,ty °T er consid ^ble aligned regions 
Despite the high identity, the raw and the statistical scores for 
such incorrect matches are typically not significant. The prin- 
cipal reasons percentage identity does so poorlv seem to be 
that it ignores information about gaps and about the conser- 
vatrve or radical nature of residue substitutions 

From the pdbwd-b analysis in Fig. 3. we learn that 30% 
.dentity is a reliable threshold for this datable onlv for 
sequence altgnments of at least 150 residues. Beca^one 

, ! f 1 ! ? " ° f P™ 6 "?* h8S 4 «* «««*y over 62 residues, 

it is probably necessary for alignments to be at least 70 residues 
.« i length before 40% is a reasonable threshold, for a da7ab« 
of this particular size and composition. 

At a given reliabUity. scores based on percentage identirv 
detect just a fraction of the distant homologs 8 Lundg 

he EH* 0 ™ 8 - " T meaSUrCS ,he Pontage identity Z 
he aligned regions w.thout consideration of alignment length, 
hen a negligible number of distant homologs are detected 
Use of the hssp equation improves the value of percentage 
identity, but even this measure can find only 4% of all kno4 
homologs at 1% EPQ. In short, percentage identity discards 
most of the information measured in a sequence comparison 
Kaw Scores. Smith-Waterman raw scores perform better 

notable benefit in our analysis. It is necessary to be very precise 
when using either raw or bit scores because a 20% change in 
cutoff score could yield a tenfold difference in EPQ. However 
it is difficult to choose appropriate thresholds because the 
reliability of a bit score depends on the lengths of the proteins 
matched and the size of the database. Raw score thresholds 
also are affected by matrix and gap parameters 

Statistical Scores. Statistical scores were introduced partly 
to overcome the problems thai arise from raw scores This 
scor.ng scheme provides the besl discrimination between 
homologous proteins and those which are unrelated Most 
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likely, its power can be attributed to its incorporation f more 
uiformation than any other measure: it takes account of the 
fall substitution and gap data (like raw sc res) but also has 
detaih about the sequence lengths and composition and is 
scaled appropriately. 1 

We find that statistical scores are not only powerful, but also 
easy to interpret, ssearch and fasta show close agreement 
between statistical scores and actual number of errors per 
query (Fig. 4). The expectation value score gives a good, 
slightly conservative estimate of the chances of the two sc-' 
quences being found at random in a given query Thus an 
E.valueof0.01 indicates that roughly one pair of nonhomoio£ 
of this similarity should be found in every 100 different queriel 
Neither raw scores nor percentage identity can be interpreted 
m this way and these results validate the suitability of the 
extreme value distribution for describing the scores from a 
database search. 

The P-values from blast also should be direetlv interpret- 
able but were found to overstate significance by more than two 
orders of magnitude for 1% EPQ for this database. Nonethe- 
less these results strongly suggest that the analytic theorv is 
fundamentally appropriate. wu-BLAST2 scores were more're- 
Hable than those from blast, but also exaggerate expected 
confidence by more than an order of magnitude at 1% EPO 
Overall Detection or Homologs and Comparison of Algo- 
rithms. The results in Fig. SA and Table 1 show that pairwise 
sequence comparison is capable of identifying only a small 
traction of the homologous pairs of sequences in pdb«d-b 
Even ssearch with E-values. the best protocol tested, could 
find only 18% of all relationships at a 1% EPQ. blast which 
identifies 159c, was the worst performer, whereas 'fasta 
Ktup = l is nearly as effective as ssearch. fasta ktup = 2 and 
WU-BLAST2 are intermediate in their ability to detect ho- 
mologs. Comparison of different algorithms indicates that 
those capable of identifying more homologs are generally 
s ower. ssearch is 25 times slower than blast and Lte 
slower than fasta ktup = 1. wu-blastz is slightly faster than 
fasta ktup = 2, but the latter has more interpretable scores 
J« r J?^ D ;*- WhC l e ,h f re are many dose «'*ionships. the 
h?m„£o °fr Can ,^ e "^ y °" ly 38% of "rurally known 
homologs (Fig. SB). The method which finds that many 
re ationships is wu-blastz Consequently, we infer that the 
differences between fasta kup = l. ssearch. and wu-blast> 
programs are unlikely to be significant when compared with 
variation in database composition and scoring reliability 

rig. 6 helps to explain why most distant homologs cannot be 
found by sequence comparison: a great many such relation- 
snips have no more sequence identity than would be expected 
by chance, ssearch with E-values can recognize >90% of the 

Pa f Wi , h 3 ° _40% idemi, y- ,n ,his "Si 0 "- 'h"e 
f F n P ? homologous proteins that do not have sign if- 
•cant E-values. but 26 of these involve sequences with <50 
T75\.? f se< ' uences navin 8 25-30% identity. 75% are 
hSlStlS SSEARCH E " Values - However - a,thou 8h 'he num- 
Mk J< T P T 1 ? ,owcr levels of iden,itv - 'he detection 
falls off sharply: only 40% of homologs with 20-25% identity 
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1 i. ? u,nou «'«»> «nd detection of homologs in pdbod-b n,„ 
show ,he distribution of homologous pairs ,3 «c«S?« £ 
identity (using u, e me „ U re of identit Z bZf^Kte*~£yV he " 
the number of these pairs found bv he bl^ a ,,h, 1°"* md,e * ,e 
(ssearch with E-vaiues) aTl% EPO Tlhl £2? """-hing method 
proteins with <40%"dem tv and „ th ?^ B d ? Wbi5e coma,ns 

25 /o or more identity, but m detection wanes sharpiv beiow 25% 

panw.se sequence comparison to detect then..' ' " 

are detected and only 10% of those with 15-20% can be found 
These results show , ha , statistical scores can find reUted 
' demiTy - " r arkab ' y ,0W: however the^wer 

Bl^rV^ 1 "^ ° f thiS WOrk ' 3 new vereion ° f P^ise 
BLAST was released: BLASTGP (37). It supports gapped align- 

WU - B ^" and d «P e ^« with sum statics. Our 

E^v fu«are°r e f^f TGP HT g dCfaUh P ara «"«ers show , ha , ,ts 
t-vaiues are reliable and that its overall detection of homoloes 
was substam.ally better than that of ungapped blastTu, nm 
quite equal to that of wu-BLast:. 

CONCLUSION 

The general consensus amongst experts isee refs 7 24 25 V 
and references therem) suggests that the most effective 'se- 

nThT made bV ( " Usin * a lar * e curre nt database 

in which the protein sequences have been complexity masked 
and («) using stat.stical scores to interpret the results Our 
experiments fully support this view. 
Our results also suggest two further points. First, the E-va|. 

estimates of the significance of each match, but the P-values 
prov.ded by blast and wu-blast; underestimate the true 
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extent of errors. Sec nd, ssEarch, wu-blastz and fast a 
ktup = 1 perform best, though blast and fasta ktup = 2 
detect most of the relationships f und by the best procedures 
and are appropriate f r rapid initial searches. 

The homologous proteins that are found bv sequence com- 
parison can be distinguished with high reliability from the huge 
number of unrelated pairs. However, even the best database 
searching procedures tested fail to find the large majority of 
distant evolutionary relationships at an acceptable error rate 
Thus, if the procedures assessed here fail to find a reliable 
match, it does not imply that the sequence is unique; rather it 
indicates that any relatives it might have are distant ones.** 

"Additional and updated information about this work, includiiu 
supplementary figures, may be found at http://sss jtanford.edu/sssA 
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The phenotypic differences thai distinguish the 
various kinds of cells in a higher eukaryote are 
largely due to' differences in the expression of 
genes that code for proteins, that is. those tran- 
scribed by RNA polymerase II. In principle, the 
expression of these genes might be regulated at 
a ny one of several stages. The concept of the 
-level of control" implies that gene expression 
is not necessarily an automatic process once it 
has begun. It could be regulated in a gene- 
specific way at any one of several sequential 
steps. We can distinguish (at least) five poten- 
tial control points, forming the series: 

Activation of gene structure 

I 

Initiation of transcription 
I 

Processing the transcript 
i 

Transport to cvloplasin 

I 

Translation of mRNA 

The existence of the first step is implied by 
the discovery that genes may exist in either of 
two structural conditions. Relative to the state 
of most of the genome, genes are found in 
an "active" stale in the ceils in which they 
are expressed (see Chapter 27). The change or 
structure is distinct from the act of transcrip- 
tion, and indicates that the gene is "transcrib- 
able." This suggests that acquisition of the 
"active" structure must be the first step in gene 
expression. 

Transcription of a gene in the active state is 



controlled at the stage of initiation, that is, by 
the interaction of RNA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis in the in vitro systems (see Chapter 
28). For most genes, this is a major control 
point: probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukary- 
otic cells, for example, via antilermination 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polyadenyla- 
tion at the 3' end. Introns must be spliced out 
from the transcripts of interrupted genes. The 
mature RNA must be exported from the nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at the level of nuclear 
RNA might involve any or all of these stages, 
but the one for which we have most evidence 
concerns changes in splicing; some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter 30). 

Finally, the translation of an mRNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of this mechanism in 
adult somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve the block- 
ing of initiation of translation of some mRNAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at multiple stages, and 
that production of RNA cannot inevitably be 
equated with production of protein, it is clear 
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that the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heat shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for a short distance on either side of 



Table 29.1 Inducible transcription factors bind to 
response elements that identify groups of promoters 
or enhancers subject to coordinate control. 



Regulatory Agent Module Consensus 



Factor 



Heat shock HSE CNNGAANNTCCNNG HSTF 

Glucocorticoid GRE TGGTACAAATGTTCT Receptor 

Phorbol ester TRE TGACTCA AP1 

Serum SRE CCATATTAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 bp upstream 
of it. The presence of a single element usually 
is sufficient to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in P r0 * 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other, 
usually an HSE is found in a promoter, while a 
GRE is found in an enhancer. We assume thai 
all response elements function by the san* 
general principle. A gene is regulated fc v 1 
sequence at the promoter or enhancer that • 
recognized by a specific protein. The P^ 1 ^ 1 
functions as a transcription factor needed J° 
RNA polymerase to initiate. Active protein * 
available only under conditions when the £ enC 
to be expressed; its absence means that 
moter is not activated by this particular & rr 

An example of a situation in which 1111 ' 
genes are controlled by a single factor * s ^ n ,. 
\ided by the heat shock response. This is j 
mon to a wide range of prokaryotes 1 ^ 
eukaryotes and involves multiple contr° 
gene expression: an increase in temp** (>Il 
turns off transcription of some genes, * ur ^ 
transcription of the heat shock S etkeSf ^K t \f» 
changes in the translation of gira |f!i 



causes 

The control of the heat shock genes 



the differences between prokaryoti c ^ ^ 
eukaryotic modes of control. In bacteria* ^4 
sigma factor is synthesized that direc* s ^ (ef . 
polymerase holoenzyme to recognize an 



